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CHAPTER 1 



Conventions, Notation, Reminders 



1.1 Mathematical Conventions 

Vectors will be denoted in boldface, x, E, or with a Latin subscript Xj, Ei, 
i = 1, 2, . . . , d. Unless otherwise specified, we will work in d = 3 dimensions. 
Occasionally, we will use Greek subscripts, e.g. j^, /x = 0, 1, . . . , d where the 
0-component is the time-component as in = (t,x,y,z). Unless otherwise 
noted, repeated indices are summed over, e.g. ciibi = ai&i+a2&2+03&3 = a b 
We will use the following Fourier transform convention: 



1.2 Plane Wave Expansion 

A standard set of notations for Fourier transforms does not seem to exist. 
The diversity of notations appear confusing. The problem is that the nor- 
malizations are often chosen differently for transforms defined on the real 
space continuum and transforms defined on a real space lattice. We shall 
do the same, so that the reader is not confused when confronted with varied 
choices of normalizations. 




(1.1) 
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1.2.1 Transforms defined on the continuum in the interval 

Consider a function f(x) defined in the interval [— L/2, L/2] which we wish 
to expand in a Fourier series. We shall restrict ourselves to the commonly 
used periodic boundary condition, i. e., f(x) = f(x + L). We can write, 

Because the function has the period L, q must be given by 2im/L, where the 
integer n = 0, ±1, ±2, .... Note that n takes all integer values between — oo 
and +oo. The plane waves form a complete orthogonal set. So the inverse 
is 

1 f L / 2 

f q = — dxe^f(x). (1.3) 

\JL J -L/2 

Let us now take the limit L — > oo, so that the interval between the 
successive values of q, Aq = 2n/L then tend to zero, and we can convert the 
(/-sum to an integral. For the first choice of the normalization we get 

f(x) = Km VI p-f q e^, (1.4) 

L^oo 2TT 



and 



/oo 
dxe iqx f(x). (1.5) 
-oo 



If we define f(q) = lim^oo \/Lf q , everything is fine, but note the asymme- 
try: the factor (l/2n) appears in one of the integrals but not in the other, 
although we could have arranged, with a suitable choice of the normalization 
at the very beginning, so that both integrals would symmetrically involve a 
factor of (1/V2tt). Note also that 

lim LS q , q , -► 2Ti5{q - q). (1.6) 

L^oo ' 

These results are simple to generalize to the multivariable case. 

1.2.2 Transforms defined on a real-space lattice 

Consider now the case in which the function / is specified on a periodic 
lattice in the real space, of spacing a, i. e., x n = na; x^/2 = L/2, x_ N / 2 = 
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—L/2, and Na = L . The periodic boundary condition implies that f(x n ) = 
f(x n + L). Thus, the Fourier series now reads 

1 

Note that the choice of the normalization in Eq. (1.7) and Eq. (1.2) are 
different. Because of the periodic boundary condition, q is restricted to 

2itm 

« = w <L8) 

but the integers m constitute a finite set. To see this note that our complete 
set of functions are invariant with respect to the shift q — ► q + G, where the 
smallest such reciprocal vectors, G, are ±(2ir/a). Thus the distinct set of 
q's can be chosen to be within the 1st Brillouin zone —(it/a) < q < (ft/a); 
accordingly, the distinct set of integers m can be restricted in the interval 
—N/2 < m < N/2. Therefore the number of distinct q's is equal to N, 
exactly the same as the number of the lattice sites in the real space. What 
about the orthogonality and the completeness of these set of plane waves? 
It is easy to see that 



N 

n=0 ° 



(rt>n =NS »j2?*!L s{q _^ (L9) 



Note the consistency of Eq. (1.6) and Eq. (1.9). The completeness can be 
written as 

= N6 n , . (1.10) 

q&lstBZ 

In the limit that iV —>■ oo, this equation becomes 

— 

a % lqXn = S nfi . (1.11) 

il 2tt a 

a 

The integration runs over a finite range of q, despite the fact that the lattice 
is infinitely large. Why shouldn't it? No matter how large the lattice is, the 
lattice periodicity has not disappeared. It is only in the limit a — > that we 
recover the results of the continuum given above. To summarize, we started 
with a function which was only defined on a discrete set of lattice points; 
in the limit iV — > oo, this discreteness does not go away but the set [q] 
approaches a bounded continuum. The function f g is periodic with respect 
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to the reciprocal lattice vectors, i.e., the entire q space can be divided up 
into periodic unit cells, but clearly not in an unique manner. 
Finally, the inverse Fourier series is given by 

f q = a^2e-^f(x n ). (1.12) 

n 

In the limit TV — > oo, 

f(x n ) = J\ ^ Xn U (1-13) 

a 

f q = aJ2e- iqXn f(x n ). (1.14) 

The prefactor a in front of this sum is actually the volume of the unit cell 
in real space. You can now generalize all this to three dimensions and work 
out the consequences of various normalizations. 

1.3 Quantum Mechanics 

A quantum mechanical system is defined by a Hilbert space, TC, whose vec- 
tors are states, \ip). There are linear operators, Oi which act on this Hilbert 
space. These operators correspond to physical observables. Finally, there 
is an inner product, which assigns a complex number, (xlV'); to any pair of 
states, \ip), |x). A state vector, gives a complete description of a system 
through the expectation values, {tp\Oi\tp) (assuming that \ip) is normalized 
so that (ip\ip) = 1), which would be the average values of the corresponding 
physical observables if we could measure them on an infinite collection of 
identical systems each in the state \ip). 
If for all vectors \ip) and |%), 

(XW> = <#%>*• (1-15) 

then the operator C is the Hermitian adjoint of O and will be denoted 
by 0t. Here c* is the complex conjugate of the complex number c. The 
notation follows Dirac and tacitly uses the dual vector space of bras {("01} 
corresponding to vector space of kets { | if>) }. Although the introduction 
of the dual vector space could be avoided, it is a very elegant and useful 
concept. Just see how ugly it would be if we were to define the scalar 
product of two vectors as (|x) , |^)) = , |x))*. 
An Hermitian operator satisfies 



o = e> f 



(1.16) 
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while a unitary operator satisfies 

OO t = O t O = l (1.17) 

If O is Hermitian, then 

e i0 (1.18) 
is unitary. Given an Hermitian operator, O, its eigenstates are orthogonal, 

(A'|0|A) = A(A'|A) = A'(A'|A) (1.19) 

For A + A', 

(A'|A) = (1.20) 

If there are n states with the same eigenvalue, then, within the subspace 
spanned by these states, we can pick a set of n mutually orthogonal states. 
Hence, we can use the eigenstates |A) as a basis for Hilbert space. Any state 
|Y>) can be expanded in the basis given by the eigenstates of O: 

M=2>|A> (1.21) 

A 

with 

cx = (A|V>>. (1-22) 

The Hamiltonian, or total energy, which we will denote by H, is a partic- 
ularly important operator. Schrodinger's equation tells us that H determines 
how a state of the system will evolve in time. 

ih^) = H\iP) (1.23) 

If the Hamiltonian is independent of time, then we can define energy eigen- 
states, 

H\E) = E\E) (1.24) 
which evolve in time according to: 

\E(t))=e-*f\E(0)) (1.25) 

An arbitrary state can be expanded in the basis of energy eigenstates: 

\^) = Y J C i \E i ). (1.26) 
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It will evolve according to: 



. E 



|Y^)) = J>e-Hh|^.). (1.27) 

3 

The usual route for constructing the quantum mechanical description of 
a physical system (Hilbert space, inner product, operators corresponding to 
physical observables) leans heavily on the classical description. The classical 
variables p, q are promoted to quantum operators and the Poisson bracket 
relation [p, <?]p.b. = 1 becomes the commutator of the corresponding opera- 
tors p, q: [p, q] = —i. Hilbert space is then constructed as the representation 
space for the algebra of the operators p, q. The theory is then "solved" 
by finding the eigenstates and eigenvalues of the Hamiltonian. With these 
in hand, we can determine the state of the system at an arbitrary time t, 
given its state at some initial time to, according to (1.26) and (1.27). This 
procedure is known as canonical quantization. 

Let us carry this out explicitly in the case of a simple harmonic oscillator. 
The solution of the harmonic oscillator will be useful preparation for the Fock 
space construction of quantum field theory. 

The harmonic oscillator is defined by the Hamiltonian, 

H = ^io(p 2 +q 2 ) (1.28) 
and the commutation relations, 

[p,q]=-i (1.29) 
We define raising and lowering operators: 



a = (q + ip) /v 2 
a) = {q- ip) jyfi 

(1.30) 

The Hamiltonian and commutation relations can now be written: 

H = uj I a* a + \ 



2 j 

[a,a)} = l (1.31) 

We construct the Hilbert space of the theory by noting that (1.31) implies 
the commutation relations, 



[H, a)} = uoa ] 
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[H,a] = -ua (1.32) 

These, in turn, imply that there is a ladder of states, 

Ha)\E) = {E + u) <J\E) 
Ha\E) = (E — u) a\E) (1.33) 

This ladder will continue down to negative energies (which it can't since the 
Hamiltonian is manifestly positive definite) unless there is an Eq > such 
that 

a\E ) = (1.34) 

To find Eq, we need to find the precise action of a, a) on energy eigen- 
states \E). From the commutation relations, we know that cfi\E) oc \E + u>). 
To get the normalization, we write (v\E) = ce\E + uo) . Then, 

\c E \ 2 = {E\aa)\E) 

= E+% (1.35) 

Hence, 



a?\E) = Je+^\E + lu) 
a\E) = ^E-^\E-u) (1.36) 

From the second of these equations, we see that o\Eq) = if Eq = u>/2. 

Thus, we can label the states of a harmonic oscillator by their integral 
a) a eigenvalues, |n), with n > such that 

H\n) = oj (n + ^ \n) (1.37) 

and 

a}\n) = \/n+ l\n + 1) 

a\n) = yf^\n - 1) (1.38) 

These relations are sufficient to determine the probability of any physical 
observation at time t given the state of the system at time to. 

In this book, we will be concerned with systems composed of many parti- 
cles. At the most general and abstract level, they are formulated in precisely 
the same way as any other system, i.e. in terms of a Hilbert space with an 
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inner product acted on by operators corresponding to observables. However, 
there is one feature of this description which is peculiar to many-particle sys- 
tems composed of identical particles and has no real classical analog: Hilbert 
space must furnish an irreducible representation of the permutation group 
acting on identical particles. We will briefly review this aspect of quantum 
many-particle systems. 

When we have a system with many particles, we must now specify the 
states of all of the particles. If we have two distinguishable particles whose 
Hilbert spaces are spanned by the bases 

\i,l) (1.39) 

where i = 0, 1, . . . are the states of particle 1 and 

|o:, 2) (1.40) 

where a = 0,1,2,... are the states of particle 2. Then the two-particle 
Hilbert space is spanned by the set: 

|t,l;a,2) = <g> |a,2) (1.41) 

Suppose that the two single-particle Hilbert spaces are identical, e.g. the 
two particles are in the same box. Then the two-particle Hilbert space is: 

\i,j) = \i,l)®\j,2) (1.42) 

If the particles are identical, however, we must be more careful because \i,j) 
and \j, i) must be physically the same state, i.e. 

\i,j) = e ia \j,i). (1.43) 

Applying this relation twice implies that 

\i,j)=e 2ta \i,j) (1.44) 

so e ia = ±1. The former corresponds to bosons, while the latter corresponds 
to fermions. The two-particle Hilbert spaces of bosons and fermions are 
respectively spanned by: 



and 



(1.46) 
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The n-particle Hilbert spaces of bosons and fermions are respectively spanned 
by: 

X)mi)>--->m«)) ( 1 - 47 ) 

IT 

and 

^(-in M i),...,M„)> (i.48) 

7T 

Here it denotes a permutation of the particles. In position space, this means 
that a bosonic wavefunction must be completely symmetric: 

1p(xi, ■ ■ ■ i Xj, . . . , Xj, . . . , Xn) = l/j^Xl, . . . , Xj, ■ ■ ■ , Xj, ■ ■ ■ , x n ) (1.49) 

while a fermionic wavefunction must be completely antisymmetric: 

Ipfal , . . . , Xj,, . . . , Xj , . . . , Xn) — lf){x\ , . . . , Xj , . . . , Xj, . . . , X n ) (1.50) 

1.4 Statistical Mechanics 

The concept of partition function is central to equilibrium statistical me- 
chanics. For a canonical ensemble that we shall frequently use, it is given 
bjZ, 

Z = Y,e~ (SEn - (1-51) 

n 

where the temperature of the ensemble, T, is 1/ksP, and ks is the Botzmann 
constant. Here E n are the energy eigenvalues of the Hamiltonian. Given the 
partition function, the macroscopic properties can be calculated from the 
free energy, F, 

F = ~]nZ. (1.52) 

To make sure that a system is in equilibrium, we must make the scale of 
observation considerably greater than all the relevant time scales of the 
problem; however, in some cases it is not clear if we can reasonably achieve 
this condition. 

Alternatively, we may, following Boltzmann, define entropy, S, in terms 
of the available phase space volume, T(E), which is 

S = k B lnT(E). (1.53) 

But how do we find T(E)? We must solve the equations of motion, that is, 
we must know the dynamics of the system, and the issue of equilibration 
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must be addressed. In contrast, in the canonical ensemble, the calculation 
of the partition function is a counting problem. 

The Boltzmann formula can be reconciled with the ensemble approach 
of Gibbs. We must determine T(E). In general, this is impossible without 
computing the trajectory of the system in the phase space. The recourse is 
to assume that T(E) is the entire volume of the phase space allowed by the 
conservation laws. No matter how complicated the motion may be, if the 
system, in the course of time, visits every point in the phase space, all we 
need to do is to calculate the measure in the phase space corresponding to 
the conserved quantities. It is convenient to introduce quantum mechanics 
at this step to simplify the argument. According to quantum mechanics each 
point in the phase space corresponds to a quantum state. So, we simply have 
to count the number of states, and we write 

T(E) =Y,$(E-E n ). (1.54) 

n 

Equation (1.53), combined with Eq. (1.54), defines the microcannonical en- 
semble of Gibbs. But in "deriving" it, we did not have to invoke the notion 
of an ensemble. 

We can go further and ask what would happen if we replaced the above 
formula by the following: 

T'(E) = Y,e-^ En ~ E) , (1-55) 

n 

where (3, for the moment, is an unknown positive number. You can show 
that the entropy defined by T'(E) leads to the same thermodynamics as the 
one defined by T(E), provided (3 = 1/ksT. We have now arrived at the 
cannonical ensemble. This is curious; in Eq. (1.54) we only sum over states 
of energy E, but in (1.55) we seem to sum over all states. The reason for this 
miracle is the extensive nature of E and S. They are of order N (~ 10 23 ). 
Consequently, the sum is so sharply peaked that practically all the weight 
is concentrated at E. Now, Eq. (1.55), combined with Eq. 1.53, leads to the 
same thermodynamics as you would obtain from a canonical ensemble. 

Although the ensemble approach is quite elegant and convenient, uncrit- 
ical use of it can be misleading. Suppose that you are given a Hamiltonian 
which has two widely separated scales, a very fast one and a very slow one. 
If the observation scale is longer than the shorter time scale, but smaller 
than the longer time scale, the slow degrees of freedom can be assumed to 
be constant. They cannot wander very much in the phase space. Thus, in 
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calculating the relevant volume of the phase space we must ignore the slow 
degrees of freedom, otherwise we would get an answer that will not agree 
with observations. 

A simple well known example of two distinct time scales is the problem 
of ortho- and para-hydrogen. The spins of the nuclei in a hydrogen molecule 
can be either in a triplet state, or in a singlet state. The interaction between 
the nuclei is negligible and so is the interaction between the nuclei and the 
electronic spins that are in a singlet state. Thus, the ortho-para conver- 
sion takes time, on the order of days, while the momenta of the molecules 
equilibrate in a microscopic time scale. Therefore, the number of nuclei in 
the singlet state and the number of nuclei in the triplet state are separately 
constants of motion on the time scale of a typical experiment, and the free 
energies of these two subsytems must be added rather than the partition 
functions. Experimental observations strikingly confirm this fact. 

When there are a few discrete set of widely separated scales, it is easy to 
apply our formulae, because it is clear what the relevant region of the phase 
space is. There are instances, however, where this is not the case, and there 
is a continuum of of time scales, extending from very short microscopic scales 
to very long macroscopic scales. The common amorphous material, window 
glass, falls into this category. If glass is to be described by a Hamiltonian, 
it is not sufficient to know all the states and sum over all of them; we must 
examine the actual dynamics of the system. Glass is known to exhibit many 
anomalous thermal properties, including a time dependent specific heat. 
In this respect, the Boltzmann formula, Eq. (1.53), can still be used. In 
principle, we could calculate the actual trajectories to determine the volume 
of the phase space sampled during the observation time. There is no need to 
use the hypothesis that T(E) is the total volume allowed by the conservation 
laws. Of course, as far as we know, this formula is a postulate as well and 
is not derived from any other known laws of physics. 

We still have to understand what we mean by an ensemble average when 
experiments are done on a single system. The ensemble average of an ob- 
servable O is defined to be 



(0) = TrpO, 



(1.56) 



where the density matrix p is given by 




n 




(1.57) 



n 
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It is more likely, however, that an experiment yields the most probable value 
of (9, that is, the value shared by most members of the ensemble. However, 
the distribution of the members in the ensemble is so strongly peaked for a 
macroscopic system that roughly only one member matters; fluctuations are 
insignificant in the thermodynamic limit defined by ./V — > oo, V — > oo such 
that p = y is a given number. The relative fluctuations in O is given by 



<o»-<o» 



<G> 2 \JN 



which is insignificant when N ~ 10 23 . Thus, the most probable value is the 
only value, hence the mean value. 

Another useful partition function is the grand canonical partition func- 
tion Zq defined by, 

Z G = Tre-M H -» N \ (1.59) 

In this ensemble, the number of particles is not fixed, and the system is 
assumed to be in contact with a particle bath as well as a heat bath. In 
the definition of the trace one must also include a sum over a number of 
particles. The average number of particles is determined by the chemical 
potential p. It is convenient to think of chemical potential as a "force" and 
the number of particles as a "coordinate" , similar to a mechanical system in 
which a force fixes the conjugate coordinate. As in mechanical equilibrium, 
in which all the forces must balance, in a statistical equilibrium the chemical 
potentials for all the components must balance, that is, must be equal. It is 
also possible to give a similar interpretation to our formula for the canonical 
ensemble where we can take the temperature as the "force" and the entropy 
as the corresponding "coordinate". For the grand canonical ensemble, we 
define the grand potential, S7: 

Q = -^\nZ G 

= F-fiN. (1.60) 

All thermodynamic quantities can be calculated from these definitions. 

Actually, we could go on, and define more and more ensembles. For 
example, we may assume that, in addition, pressure P is not constant and 
define a pressure ensemble, in which we add a term —PV in the exponent. 
For every such extension, we would add a "force" multiplied by the corre- 
sponding conjugate "coordinate". We could also consider an ensemble in 
which the linear momentum is not fixed etc. 
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The definition of the free energies allow us to calculate various thermo- 
dynamic quantities. Since 

F = E-TS (1.61) 



and 
we get 
Then, 



dE = TdS - PdV + /idN, (1.62) 
dF = -SdT - PdV + fidN. (1.63) 



<9T 



V,AT 



T,AT 



(1.65) 



dN 



T,V 



Similarly, from the definition of the definition of the thermodynamic poten- 
tial 0, we can derive the same relations as 
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CHAPTER 2 



Phonons and Second Quantization 



2.1 Classical Lattice Dynamics 

Consider the lattice of ions in a solid. Suppose the equilibrium positions of 
the ions are the sites R{. Let us describe small displacements from these 
sites by a displacement field u{Ri). We will imagine that the crystal is just 
a system of masses connected by springs of equilibrium length a. 

At length scales much longer than its lattice spacing, a crystalline solid 
can be modelled as an elastic medium. We replace u(Ri) by u(r) (i.e. we 
replace the lattice vectors, Ri, by a continuous variable, r). Such an approx- 
imation is valid at length scales much larger than the lattice spacing, a, or, 
equivalently, at wavevectors q <C 2w/a. 




Figure 2.1: A crystalline solid viewed as an elastic medium. 
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The potential energy of the elastic medium must be translationally and 
rotationally invariant (at shorter distances, these symmetries are broken to 
discrete lattice symmetries, but let's focus on the long-wavelength physics 
for now). Translational invariance implies V[u + uq] = V[u], so V can only 
be a function of the derivatives, diUj. Rotational invariance implies that it 
can only be a function of the symmetric combination, 

There are only two possible such terms, UijUij and u\ k (repeated indices are 
summed). A third term, Ukk, is a surface term and can be ignored. Hence, 
the action of a crystalline solid to quadratic order, viewed as an elastic 
medium, is: 



dtd 3 r£ = i J dtd 3 f p(d t Ui) 2 - 2puijUij - Xu 2 kk (2.2) 



where p is the mass density of the solid and p and A are the Lame coefficients. 
Under a dilatation, u(r) = af, the change in the energy density of the elastic 
medium is a 2 (X + 2///3)/2; under a shear stress, u x = ay, u y = u z = 0, it is 
a 2 p/2. In a crystal - which has only a discrete rotational symmetry - there 
may be more parameters than just p and A, depending on the symmetry 
of the lattice. In a crystal with cubic symmetry, for instance, there are, in 
general, three independent parameters. We will make life simple, however, 
and make the approximation of full rotational invariance. 

2.2 The Normal Modes of a Lattice 

Let us expand the displacement field in terms of its normal-modes. The 
equations of motion which follow from (2.2) are: 

pd 2 Ui = (p + A) didjiij + pdjdjUi (2-3) 

The solutions, 

u i {r,t)=e i e i ^-" t ) (2.4) 
where is a unit polarization vector, satisfy 

-puo 2 ti = - (p + A) ki (kj€j) - pk 2 ei (2.5) 
For longitudinally polarized 



J k = ± ^±A k = ±Vlk (2.6) 
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while transverse waves, kjCj = have 

ul = ±J^k = ±v s k (2.7) 
Hence, the general solution of (2.3) is of the form: 

Ul (f, t) = Ey^f e * (n/^-^ + 4,. e " i(£-f, " wSt) ) (2 - 8) 



k,s 



s = 1, 2, 3 corresponds to the longitudinal and two transverse polarizations. 
The normalization factor, 1/ y / 2pu>f,, was chosen for later convenience. 

The allowed k values are determined by the boundary conditions in a 
finite system. For periodic boundary conditions in a cubic system of size 
V = L 3 , the allowed fc's are ^ (rai, ri2, n^). Hence, the A?-space volume 
per allowed k is (2tt) 3 /V. Hence, we can take the infinite-volume limit by 
making the replacement: 



J d 3 kf(k) (2.9) 



k 

V 



(2vr) 3 

It would be natural to use this in defining the infinite- volume limit, 
but we will, instead, use the following, which is consistent with our Fourier 
transform convention: 

u .(f t )- ! V 1 e s (V e^-" 3 ^ + a t ~i(k-r-^ k t)\ 



(2.10) 



2.3 Canonical Formalism, Poisson Brackets 

The canonical conjugate to our classical field, Ui, is 

dC 

d(d t Ui) 



The Hamiltonian is given by 

H = J d 3 fiTidtUi — C 
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%Ui) + 2/j.UijUij + Xu 



kk 



= 5 / fir 



- 7T? + 2flUi jUij + Au^ fc 



(2.12) 



Let us define the functional derivative, 

dT dT 



6_ 

5rj 



J d 3 fT(r) 



drj d(dii]) 
Then the equation of motion for 7Tj can be written 

8H 



(2.13) 



dm = - 



5m 



while 



dm 



(2.14) 



(2.15) 



From these equations, we see that it is natural to define the Poisson 
brackets: 

f d 3 f f SU6v SU 5V\ 
J \Sui S-Ki 5-Ki 5ui J 

With this definition, 

[uj(f)M?)]pB = 8 (?-■?) (2.17) 

and 



d t Ui = [ui,H] PB 



(2.18) 



As we will see shortly, the normalization chosen above for the normal 
mode expansion of Uj(r) is particularly convenient since it leads to: 



,t 



-i S, q i 5 



(k-k 1 ) 



(2.19) 



When we quantize a classical field theory, we will promote the Poisson 
brackets to commutators, [, ]pb — ► i [>]■ 
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2.4 Motivation for Second Quantization 

The action (2.2) defines a classical field theory. It has 3 degrees of freedom 
per spatial point - i.e. it has infinitely many degrees of freedom. This is a 
consequence of the continuum limit which we took. A real finite-size sample 
of a solid has a finite number of degrees of freedom: if there are ./V ions, there 
are 3N degrees of freedom, f±, r-z, . . . , r/v- However, it is extremely conve- 
nient to take the continuum limit and ignore the difference between 3N and 
oo. Furthermore, we will also be concerned with the electromagnetic field, 
E = — V</3 — dtA, B = V x A, which does have infinitely many degrees of 
freedom (2 per spatial point when gauge invariance is taken into account). 
By going to the continuum limit, we can handle the electromagnetic field 
and an elastic medium in a parallel fashion which greatly facilitates cal- 
culations. We thereby make a transition from classical particle mechanics 
(with a discrete number of degrees of freedom) to classical field theory (with 
continuously many degrees of freedom) : 

r a u(x , t) 
t «-► t 

a^x (2.20) 

At the quantum level, we will be dealing with wavefunctionals of the form 
^[^(r)] or ^[yl(r)] rather than ip(ri,r2, ■ ■ ■ , r^). The coordinates r are no 
more than indices (but continuous ones) on the fields. Hence, the operators 
of the theory will be u(r), dtu{f) or A(r), dtA(r) rather than f a ,p a . 

In this approach, the basic quantities will be the normal modes of the 
displacement field, rather than the ionic coordinates. As we will see be- 
low, the collective excitations of an elastic medium are particle-like objects 
- phonons - whose number is not fixed. Phonons are an example of the 
quasiparticle concept. In order to deal with particles whose number is not 
fixed (in contrast with the ions themselves, whose number is fixed), we will 
have to develop the formalism of second quantization. 1 

2.5 Canonical Quantization of Continuum Elastic 
Theory: Phonons 

2.5.1 Review of the Simple Harmonic Oscillator 

No physics course is complete without a discussion of the simple harmonic 
oscillator. Here, we will recall the operator formalism which will lead natu- 
rally to the Fock space construction of quantum field theory. 
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The harmonic oscillator is defined by the Hamiltonian, 

1 
2 



H =\u{p 2 +q 2 ) (2.21) 



and the commutation relations, 

[p,q]=-i (2.22) 
We define raising and lowering operators: 



(2.23) 



a = (q + ip) / v2 
a) = (q — ip) j\Pl 



The Hamiltonian and commutation relations can now be written: 

r 



H = oj I a^a + - 



The commutation relations, 



(2.24) 



[H, a)} = ua) 

[H,a]=-ua (2.25) 

imply that there is a ladder of states, 

Ha)\E) = (E + u)a*\E) 
Ha\E) = {E-uj)a\E) (2.26) 

This ladder will continue down to negative energies (which it can't since the 
Hamiltonian is manifestly positive definite) unless there is an Eq > such 
that 

a\E ) = (2.27) 

Such a state has Eq = tu/2. 

We label the states by their a^a eigenvalues. We have a complete set of 
H eigenstates, |n), such that 

H\n) =uj(n+^\\n) (2.28) 

and (at) n |0) oc \n). To get the normalization, we write a'|n) = c n \n + 1). 
Then, 

\cn\ 2 = (n|aa^|n) 
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= n + l (2.29) 

Hence, 

a*\n) = \Jn + l|n + 1) 

a|n) = V"|n-l) (2-30) 

2.5.2 Fock Space for Phonons 

A quantum theory is made of the following ingredients: 

• A Hilbert Space H of states \ip) G H. 

• Operators Oi on H, corresponding to physical observables. 

• An Inner Product (xlV'} which must be defined so that Oi is Hermitian 
with respect to it if the corresponding physical observable is real. 

In order to construct these objects for an elastic medium - thereby quan- 
tizing our classical field theory - we employ the following procedure. We 
replace the classical variables, Ui, 7Tj by quantum operators satisfying the 
canonical commutation relations: 

[Ui (x, t) , Uj (x, t)] = [iTi (X, t) , TTj (x,t)] = 

[u{ (x, t) , iTj (x , i)] = i Sij5 (x — x') (2-31) 
We can now define the operators g , at according to: 

%s = \<y/Wi J («i(*,0) + ^ft«i(*,0)) 

a { s = rtV^4 I («i (*><>) - ^(*,0)) e-^(2.32) 

These expressions can be inverted to give the normal-mode expansion, (8.72). 
Using 7Tj = pd t Ui, and the above commutation relations, we see that g and 

at satisfy the commutation relations: 

k,s J 



1 r77 « s> f d 3 x r d 3 



x 



(27r) 3 /2 7 ( 27 r) 3 /2 

(^(f,0) + -L e 1 ^, (uj (x',o) - -^c^ 
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,,/„„/ f d 3 x f d 3 x' 



( 



i 



Ui (x,0), rrdtUj (x',0 

) 



+ 



2 y u k u k' e t e j 
= 5 ss ,5(k-k') 

We can similarly show that 



d t Ui (x,0) ,Uj (x',o) 

d 3 x f d 3 x' fj_ _L_\ / -a 
(2vr)W (2vr)3/2 ^2 + "Zj 3 ^ * ' 



ik-x—ik'-x' 



at , at 



(2.34) 



We can re-write the Hamiltonian, iJ, in terms of ar and at by substi- 



k,s 



tuting (8.72) into (2.12). 



k,s 



H = 



+ a kA,s { p ^ 2 + vk 2 + 5 sl (n + \) k 2 ) 
+ 4,«°M {P^'kf + + <5 s i ( /x + A ) k 2 ) 



(2.35) 



Hence, the elastic medium can be treated as a set of harmonic oscillators, 
one for each k. There is a ground state, or vacuum state, |0) which satisfies: 

a k , s |0> = (2.36) 

for all k, s. The full set of energy eigenstates can built on the vacuum state: 

(4 1 , 1 ) ni ( a L S2 ) n2 ---«,J nj 10) (2-37) 

The Hilbert space spanned by these states is called Fock space. We demand 
that the states of the form (2.37) are an orthogonal basis of Fock space, 
thereby defining the inner product. The state (2.37), which has energy 



(2.38) 



(2.33) 
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can be thought of as a state with n\ phonons of momentum k\ and polar- 
ization si; ri2 phonons of momentum ki and polarization si\. . . ; n j phonons 
of momentum kj and polarization Sj. The creation operator at creates 
a phonon of momentum ki and polarization Sj while the annihilation oper- 
ator ajfc 1)S1 annihilates such a phonon. At the quantum level, the normal- 
mode sound-wave oscillations have aquired a particle-like character; hence 
the name phonons. 

You may have observed that the above Hamiltonian has an infinite con- 
stant. This constant is the zero-point energy of the system; it is infinite 
because we have taken the continuum limit in an infinite system. If we go 
back to our underlying ionic lattice, we will find that this energy, which is 
due to the zero-point motion of the ions, is finite. The sum over k really 
terminates when u>k is the Debye energy. For the most part, we will not be 
interested in this energy (see, however, the problem set), so we will drop it. 
This can be done by introducing the notion of a normal-ordered product, 
which will be useful later. The normal-ordered product of a set of a) k _ s .'s 

and at.. s .'s is the product with all of the at „ 's to the left and all of the 
afc^s/s to the right. It is denoted by a pair of colons. For example, 

'• a ki,s 1 a \ :i ^ 1 a k2,S2 := a i ljSl a fci,si a fc2,S2 (2.39) 

Since creation operators commute with one another and annihilation oper- 
ators do as well, we do not need to speficy their orderings. Hence in the 
above example, the ordering of afe 1;Sl and afc 2jS2 above is unimportant. The 
normal ordered product can be defined for any free fields, i.e. for any fields 
which can be expanded in creation and annihilation operators with time- 
dependence of the form (8.72). Suppose A is such an operator. Then we can 
always write A = A^ + A^ where A^ is the part of the expansion of A 
which contains positive frequencies, e lu)t and A^ is the part which contains 
the negative frequencies. Normal-ordering puts the A^ + ^'s to the left and 
the A(~^s to the right. If we define the quantum Hamiltonian to be : H :, 
then we eliminate the zero-point energy. 

The divergent zero-point energy is the first of many ultra-violet diver- 
gences which we will encounter. They occur when we extend the upper limit 
of k- integrals to infinity. In fact, these integrals are always cutoff at some 
short length scale. In most of the problems which we will be discussing in 
this course, this cutoff is the inverse of the lattice scale. In the above ex- 
ample, this is the wavevector corresponding to the Debye energy. When we 
turn to electrons, the cutoff will be at a scale of electron volts. When field 
theory is applied to electrodynamics, it must be cutoff at the scale at which 
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it becomes unified with the weak interactions, approximately WOGeV . 
2.5.3 Fock space for He 4 atoms 

We can use the same formalism to discuss a system of bosons, say He 4 
atoms. This is particularly convenient when the number of He atoms is not 
fixed, as for instance in the grand canonical ensemble, where the chemical 
potential, /j,, is fixed and the number of particles, N, is allowed to vary. 
Suppose we have a He 4 atom with Hamiltonian 



H 



p2 

2m 



(2.40) 



The energy eigenstates \k) have energies and momenta 

H\i) _ * |S) 
P\k) = k\k) 

They are orthogonal: 

(k'\k) = 5{k-k') 
If we have two particles, we have eigenstates \k\, k 2 ) with 

P\h,k 2 ) = (fci + fca) \h,k 2 ) 

satisfying 

(h, k 2 \k 3 , fe 3 ) = 5(fei - fc 3 ) S(k 2 - h) + <J(/si - fc 4 ) <5(fc 2 - fe 3 ) 



(2.41) 
(2.42) 



(2.43) 



(2.44) 



We can continue in this way to 3-particle, 4-particle,. . . , etc. states. There 
is also a no-particle state: the vacuum, |0). 

The Hilbert space spanned by all of these states is Fock space. We can 
define creation and annihilation operators a£, a& satisfying: 



i a k> a k>] 



6(k - k') 




4' 4 



(2.45) 



so that 



al\0) 
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i^,fc 2 )=4 i 4 i |o) 

etc. (2.46) 

Writing 

H = J d 3 kuj k atag 
d 3 kk 

a \ a k (2.47) 



where uj^ = k 2 /2m, we see that we recover the correct energies and momenta. 
From the commutation relations, we see that the correct orthonormality 
properties (2.42) (2.44) are also recovered. 

We can now construct the fields, ip(x), ^(x): 

ip(x) satisfies the equation: 

In other words, suppose we view the Schodinger equation as a classical wave 
equation - analogous to the wave equation of an elastic medium (2.3) - which 
can be derived from the action 

S-JdtW^ + l^)* (2.50) 



Then, we can second quantize this wave equation and arrive at the Fock 
space description. 
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CHAPTER 3 



Perturbation Theory: Interacting Phonons 



3.1 Higher- Order Terms in the Phonon Lagrangian 

The second quantization procedure described in the previous chapter can 
be immediately applied to any classical field theory which has a Lagrangian 
which is quadratic in its basic fields. However, most systems have La- 
grangians with higher-order terms. For instance, there are certainly terms in 
the phonon Lagrangian which we have neglected which are cubic, quartic, 
and higher-order in the displacement fields, u^. An example of a phonon 
Lagrangian with such a term included is 



We use this phonon Lagrangian as an illustrative example; it is not intended 
to be a realistic phonon Lagrangian. 

Classically, the presence of such terms means that different solutions can 
no longer be superposed. Hence, there is no normal mode expansion, and 
we cannot follow the steps which we took in chapter 2. When g is small, we 




(3.1) 



The Hamiltonian corresponding to (3.1) is: 




= H + H' 



(3.2) 
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can, however, hope to use perturbation theory to solve this Hamiltonian. In 
this chapter, we develop a perturbation theory for H' using the solution of 
Hq presented in chapter 2. As we will see, higher-order terms in the phonon 
Lagrangian lead to interactions between the phonons which cause them to 
scatter off each other. 

In order to facilitate the construction of the perturbation theory, we 
will need several technical preliminaries: the interaction picture, the time- 
ordered product, and Wick's theorem. 

3.2 Schrodinger, Heisenberg, and Interaction Pic- 
tures 

In the Schrodinger picture, states evolve in time according to: 

i§f. W))s = H (t) W))s ( 3 - 3 ) 

while operators are time-independent unless they have explicit time depen- 
dence. For example, if we have a particle in ID, p and x do not depend 
on time, but we can switch on a time-dependent driving force in which case 
the Hamiltonian, H(t) = p 2 /2m + x coswt, is time-dependent. The time- 
evolution operator, U (i, t') acts on states in the following way: 

\m)s = U{tJ)W))s (3-4) 

It satisfies the equation 

i§-U(t,t') = H(t)U(t,t') (3.5) 

subject to the initial condition, U(t,t) = 1. If H is time-independent, then 

U(t,t') = e-^- t,)H (3.6) 
In the Heisenberg picture, on the other hand, states are time-independent, 

\m) H = \m) s = \m) H m 

while operators contain all of the time-dependence. Suppose Os(t) is an 
operator in the Schrodinger picture (we have allowed for explicit time de- 
pendence as in H(t) above). Then the corresponding Heisenberg picture 
operator is: 

H (t) = [/(0,t)O 5 (t)(f/(0,t)) t (3.8) 
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Finally, we turn to the interaction picture, which we will use extensively. 
This picture can be defined when the Hamiltonian is of the form H = 
Hq + H' and Hq has no explicit time-dependence. The interaction picture 
interpolates between the Heisenberg and Schrodinger pictures. Operators 
have time-dependence given by Hq: 

O I (t) = e itH »O s {t)e- itH0 (3.9) 

This includes the interaction Hamiltonian, H' which now has time-dependence 
due to Hq: 

H z (t) = e itHo H' s (t) e- itHo (3.10) 

(We will drop the prime and simply call it Hj.) The states lack this part of 
the time-dependence, 

\m)i = e itHo \m)s (3-ii) 

Hence, states satisfy the differential equation 



= e itH °(-H + H s )\m)s 
= e itHo H' s (t) e~ itH ° \i>{t))j 

= H I (t)W))i ' (3-12) 

We can define an Interaction picture time-evolution operator, Ui(t,t'), 
satisfying 

i- Ui(t, t') = H^U^t 1 ) (3.13) 
which evolves states according to 

= £mm')i^')>, (3.i4) 



3.3 Dyson's Formula and the Time-Ordered Prod- 
uct 

If we can find Ui(t, t'), then we will have solved to full Hamiltonian Hq + H', 
since we will know the time dependence of both operators and states. A 
formal solution was written down by Dyson: 



Uj(t,t') = r{ e - l />"^")} 



(3.15) 
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where the time- ordered product, T, of a string of operators, Oi(ii)02(i2) • • • r , 
is their product arranged sequentially in the order of their time arguments, 
with operators with earlier times to the right of operators with later times: 

r{Oi(ti)o 2 (t 2 ) • • • o n (t n )} = o^ujOiSi,) ■ ■ ■ o in (t in ) 

if t h >t i2 > ■■■ >U n (3.16) 

There is some ambiguity if ti = tj and 0(ti) and 0(tj) do not commute. In 
(3.15), however, all of the Oj's are Hj, so we do not need to worry about 
this. 

To see that it satisfies the differential equation (3.13), observe that all 
operators commute under the time-ordering symbol, so we can take the 
derivative naively: 

.d 
i- 



dt 



T = T [H^e-^" 11 ^} (3.17) 



Since t is the upper limit of integration, it is greater than or equal to any 
other t" which appears under the time-ordering symbol. Hence, we can pull 
it out to the left: 



d 



r{ e -i#*" = Hj(t)T{e-^' dt " H ^} (3.18) 

With Dyson's formula in hand, we can - at least in principle - compute 
transition amplitudes. For example, let us suppose that we have a system 
which is in its ground state, |0). Suppose we perform a neutron scattering 
experiment in which a neutron is fired into the system with momentum P at 
time t' and then interacts with our system according to Hi. The probability 
(which is the square of the amplitude) for the system to undergo a transition 
to an excited state (1| so that the neutron is detected with momentum P' 
at time t is: 

(^P'lUj^t'MP)^ (3.19) 

Of course, we can rarely evaluate U[(t,t') exactly, so we must often 
expand the exponential. The first-order term in the expansion of the expo- 
nential is: 

-i / dtxH^t-i) (3.20) 
Jt> 

Hence, if we prepare an initial state \i) at t' = — oo, we measure the system 
in a final state (f\ at t = oo with amplitude: 

/oo 
dtH!(t)\i) (3.21) 
-oo 



3.4. WICK'S THEOREM 
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Squaring this, we recover Fermi's Golden Rule. There is a slight subtlety 
in that the t integral leads to an amplitude proportional to 8{Ei — Ef). 
This appears to lead to a transition probability which is proportional to 
the square of a 5-function. We understand, however, that this is a result of 
taking the limits of integration to infinity carelessly: the square of the 5- 
function is actually a single 5-function multiplied by the difference between 
the initial and final times. Hence, this implies that the transition rate is: 



with one 5-function dropped. 

To get a sense of the meaning of the T symbol, it is instructive to consider 
the second-order term in the expansion of the exponential: 



j dh J dt 2 T -(#j(ti)ffj(t2)) = H) J dh J dt 2 H I {t 1 )H I {t 2 ) 

(3.23) 

3.4 Wick's Theorem 



We would like to evaluate the terms of the perturbation series obtained by 
expanding Dyson's formula (3.15). To do this, we need to compute time- 
ordered products T{Hj Hf . . . Hj}. This can be done efficiently if we can 
reduce the time-ordered products to normal-ordered products (which enjoy 
the relative simplicity of annihilating the vacuum) . 

To do this, we define the notion of the contraction of free fields (remember 
that, in the interaction picture, the operators are free and the states have 
complicated time-dependence), which we will denote by an overbrace: 




(3.22) 





i4(ti)S(t 2 ) = T (A(h)B(t 2 )) - : A(ti)B{t 2 ) : (3.24) 

Dividing A and B into their positive- and negative-frequency parts, 4^, 
we see that: 



A(t 1 )B(t 2 ) = \a(-\B^] (3.25) 



if ti > t 2 and 



A(h)B{t 2 ) 



b(-\aM 



(3.26) 



if t\ < t 2 . This is a c- number (i.e. it is an ordinary number which com- 
mutes with everything) since [a, <r] = 1. Hence, it is equal to its vacuum 
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expectation value: 



A(ti)B(h) = (0\A(ti)B(h)\0) 

= (0\T(A( tl )B(t 2 ))\0) - (0\:A( tl )B(t 2 ):\0) 

= (0\T(A(t 1 )B(t 2 ))\0) (3.27) 

The following theorem, due to Gian-Carlo Wick, uses the contraction to 
reduce time-ordered products to normal-ordered products: 

T{u 1 u 2 ... u n } = : mu 2 ... u n : 

+ : u±U2 ■ ■ ■ u n : + other terms with one contraction 

+ : ui^u^U4 . . . u n : + other terms with two contractions 

+ : uyw} ■ ■ ■ y^iu^ ■ 

+ other such terms if n is even 
+ :u\u 2 ... u n - 2 u n -\ u n : 

+ other such terms if n is odd (3.28) 

The right-hand-side is normal-ordered. It contains all possible terms with 
all possible contractions appear, each with coefficient 1. The proof proceeds 
by induction. Let us call the right-hand-side w{u\ u 2 . . . u n ). The equality 
of the left and right-hand sides is trivial for n = 1, 2. Suppose that it is true 
for time-ordered products of n — 1 fields. Let us further suppose, without 
loss of generality, that t\ is the latest time. Then, 

T {u\ u 2 ... u n } = u\ T {u 2 ... u n ] 
= uiw (u 2 , ...,u n ) 

= W (U2, ...,U n ) + l4~ ) W (U 2 , ...,U n ) 

= n^ +) w (u 2 , ...,u n ) + w (u 2 , ... ,u n ) + 



= w (u!,u 2 , . . . ,u n ) (3.29) 

The equality between the last two lines follows from the fact that the final 
expression is normal ordered and contains all possible contractions: the first 
two terms contain all contractions in which u\ is not contracted while the 
third term contains all contractions in which u± is contracted. 

A concise way of writing down Wick's theorem is the following: 

T{ Ul u 2 ... u n } =: e^ l ' 3=1 1 3 ^ 9 ^ u 1 u 2 ...u n : (3.30) 



3.5. THE PHONON PROPAGATOR 
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3.5 The Phonon Propagator 

As a result of Wick's theorem, the contraction of two phonon fields, UiUj, 
is the basic building block of perturbation theory. Matrix elements of the 
time-evolution operator will be given by integrals of products of contractions. 
The contraction v^Uj is also called the phonon propagator. In the problem 
set, you will compute the propagator in two different ways. First, you will 
calculate it directly from: 

(T{u i {x 1 ,t 1 )u j (x2,t 2 ))} = [y + \tij(->] (3.31) 

You will also calculate it by noting that 

(T (ui(xi, ti)uj(x 2 , t 2 ))} = 0(ti-t 2 ) (ui(xi,ti)uj(x 2 , t 2 ))+9(t 2 -ti) {uj(x 2 ,t 2 )ui(xi, 

(3.32) 

and acting on this with (2.3) to obtain, 

(pS ik d 2 - (n + \)did k - fidikdidi) (T (u k (xi, h)uj(x 2 , t 2 ))) 
= -iS (xi - x 2 ) 5 (ti - t 2 ) 5ij (3.33) 

By Fourier transforming this equation, we find: 



1 f d 

(T (ui (xi ,ti)uj(x 2 ,t 2 ))) = - J 



d 3 p <ko jWte-f.u^ft^t,)) %e i e . 



(p-(xi-xi)-w(ti-t 2 )) i 3 

) 3 2vr " uj 2 - (u s v ) 



e 



2 



(3.34) 

Here, we have used efej = Sij. However, the singularities at lo 2 = (tOp) 2 
are unresolved by this expression. As you will show in the problem set, the 
correct expression is: 



(T(u (xi U)u-(xo to))) - - I — e i ff(*i-a»)-"( f i-fa)) ^ 

[1 { Ul { Xl M)u 3 [,x 2 ,t 2 ))) - p J (2 ^3 27r e u*-(u>)* + i6 

(3.35) 

Since ejki = k while e 2,3 ki = 0, 



l i _ 

k 2 

^ + 4z) = &* - ^ (3.36) 



1 2 3 

Hence, using uj^ = vip, uj p ' = v s p, we can rewrite the phonon propagator 
as: 

(T (ui(xi, h)uj(x 2 , t 2 ))) = - [ Wtem^w-w ihkj/k* 

p J (27T) ,:1 27T uj 2 - vfp z + id 



38 



CHAPTER 3. PERTURBATION THEORY: INTERACTING 

PHONONS 



l r d 3 p du i{f{Sl _ Sl) _ w{tl _ t2)) i (Sjj - kjkj/k 2 ) 

pj (2ir) 3 2ir to 2 -v 2 p 2 + iS 

(3.37) 

For some purposes, it will be more convenient to consider a slightly 
different phonon field, 

^ *) = / (0r < + 4,. e " i( * v_wSt) ) ^ 

The difference with m is the missing \j ^J2uj s k . This field has propagator: 

(T(u-(x, U) U (xo to))) - - f ^ ^ J(P<ZlSl)-"(tl-t2)) 2lU P ^3 

= - f dZpdcue^-^-"^-^ ( K ^ + ^ ) 
pj \uj-ujs + id uj + uj s p -idj 

(3.39) 

3.6 Perturbation Theory in the Interaction Pic- 
ture 

We are now in position to start looking at perturbation theory. Since trans- 
verse phonons are unaffected by the interaction (3.1), we only need to discuss 
longitudinal phonons. Consider the second-order contribution in our theory 
of phonons with a quartic anharmonicity (3.1), 

(— i) 2 f°° f°° 
l/(-oo,oo) = ^- / dh dt 2 T (Hjit^Hjfo)) 

^- J — OO J — CO 



2! 



J d 3 xidt 1 d 3 x 2 dt 2 T {{d k u k (xi, h)) 4 (d k u k (x 2 ,t 2 )) 

(3.40) 



When we apply Wick's theorem, we get such terms as: 



1 d 3 xidt 1 d 3 x 2 dt 2 : d k u k d k u k d k u k d k u k (x 1 ,t 1 ) d k u k d k u k d k u k d k u k (x 2 ,t 2 ) : 



2! 

(3.41 

This term will contribute to such physical processes as the scattering be 
tween two longitudinal phonons. If we look at 

(&3, l;k±,l;t = oo\U(— oo, oo)\k±, l;k 2 ,l;t = — oo) = 
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f-zo/4!) 2 f , o • " * 

... H - / (Fxidti(Fx2dt2 d k u k d k u k (x 1 ,t 1 )d k u k d k u k (x2,t 2 ) x 

(k 3 ,l;k4,l;t = oo\ : d k u k d k u k (xi, h) d k u k d k u k (x2, t 2 ) : |fci, Z; fc 2 , Z; i = -oo) 
+ . . . (3.42) 

this will give a non- vanishing contribution since two of the uncontracted Uj's 
can annihilate the phonons in the initial state and the other two can create 
the phonons in the final state. Let's suppose that the incoming phonons 
are annihilated by d k u k ^ d k u k ^ at {x\,t{) and the outgoing phonons are 
created by the d k u k + ^d k u k + ^ at (x 2 ,t 2 ). Since 

d k u k - ) d k u k -\x 1 ,t 1 )\k 1 J-,k 2 ,ht = -oo) = - \h\ \k 2 \ (e^-<^ e *(fe-*i-«L a *i)) | ) 

(3.43) 

we obtain a contribution to (3.42) of the form: 

(-^/4!) f d 3 x 1 dt 1 (fx2dt2 {\ki\ \k 2 \\k 3 \ \k 4 \ x 



2! 

^ e i((ki+k2)-3i-(w l kl W k2 )ti)^ ^ e -i((k3+k4)-x 2 -(u> l k:} W k4 )t 2 )^ 



d k u k (xi,ti) d k u k (x 1 ,t 1 )d k u k (x2,t 2 ) d k u k (x 2 ,t 2 )} (3.44) 



Substituting the expression for u k (xi,ti)u k (x2,t2), we find: 

H§£2! / A 2d(2 |ft £ {|5l| |iEl| x 

^i((ki+k2-pi-p2)-xi-(u> l ki +u> l k2 -u>i-u)2)ti)^ ^-i{(kz+ki-pi-p2)-X2-(u)\, 3 +ui l ki -uJi-ui2)t2)^ 

P u( - (iJjJ 2 + iS p u{ - (u4) 2 + i5 

The x and t integrals give 5 functions which enforce momentum- and energy- 
conservation. 

F2IF3I |« 4 | x 



2! 7 (2tt) 3 2^ (2tt) 3 2tt 

(2vr) 3 (5(fci + fc 2 - Pi ~ fht) 2ir6(w l kl + J k2 - wi - w 2 ) 
(2vr) 3 (5(4 + fei - pi - pb) 27r<5(4 3 + 4 4 - wi - w 2 ) 

— N 2 — / / \ 2 , • r N 2 — / , x 2 1 ( 3 - 46 ) 

P w{ - ((J pi ) 2 + 18 w$ - (uj 1 ^) 2 + 16 
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which, finally, gives us 

N ' - 2 -«) 2 + ^ + fe2 " ^ K+<--i) 2 -K 1+fe2 - P1 ) 2 + ^ 

(2vr) 3 <5(fc 1 + fc 2 - fc 3 - 27rS(u l kl + < - ^ - <)} (3.47) 

There are actually several ways in which an identical contribution can 

be obtained. By an identical contribution, we mean one in which there are 

/ * > 

two contractions of the form u k (x*i, ti)u k (x*2, fa)', the incoming phonons are 
annihilated at the same point (which can be either (xi,ti) or (x2,t 2 ) since 
these are dummy variables which are integrated over); and the outgoing 
phonons are created at the same point. The incoming phonons are annihi- 
lated by the Uj's at (x±,ti) and the outgoing phonons are annihilated by the 
itj's at (x2,t2), which can be done in (4 • 3)(4 • 3) ways. There are 2 ways 
in which we can choose how the remaining Uj's at (xi,h) are contracted 
with the remaining Uj's at (x2>*2)> giving us a multiplicity of (4 • 3) (4 • 3)2. 
It is now clear why we included a factor of 1/4! in our definition of g: the 
above multiplicity almost cancels the two factors of 1/4!. Only a factor of 
1/2 remains. If we permute (xi,t\) and (x2,t 2 ), then the incoming phonons 
are annihilated by the Uj's at (£2,^2) and the outgoing phonons are anni- 
hilated by the itj's at (xi,ti). This gives an identical contribution, thereby 
cancelling the 1/2! which we get at second-order. Hence, the sum of all such 
contributions is: 

(-ig) 2 1 f d 3 p\ du\ 



7 1 J2^^ {NNNI^ 

l^ll 2 ~2 / / , t 1^1 + ^2 - Pi' 2 



- 2 -KJ 2 + ^ li ™ K+<-^) 2 -K + , 2 - P1 ) 2 + ^ 

(27T) 3 5(h +k 2 -h- k 4 ) 2ir8{J kl + J k2 - 4 3 - J kA )} (3.48) 

There are, of course, other, distinct second-order contributions to the two 
phonon — > two phonon transition amplitude which result, say, by contracting 
fields at the same point or by annihilating the incoming phonons at different 
points. Consider the latter contributions. Ther is a contirbution of the form: 

(—ig) 2 1 f d 3 pi du>\ 



l^ll 2 ~2 / / \ O ■ r \h ~ h -fl' 2 



uj - «) 2 + i6 1 ™ {J ki - J k3 - u,) 2 - (u l ki _ k3 _ pi ) 2 + iS 
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{2ir) 3 S(k 1 +k 2 -k 3 - lU) 2tt<5(u4 + J k2 - 4 3 - 4J} 



(3.49) 



and one with k% — > £4. 



The cancellation which we obtained by permuting the different (xj,ij)'s 
does not always occur. For instance, the following contraction at second- 
order makes a contribution to the amplitude for the vacuum at t = —00 to 
go into the vacuum at t = 00: 



(0; i = oo\U(— 00, oo)|0; i = — 00) = 




x 2 dt 2 d k u k (xi,ti)d k u k (x2,t 2 )d k u k (xi,ti)d k u k (x2,t2) x 



d k u k d k u k d k u k d k u k (0;t = oo|0;i = -00) 
+ ... 



(3.50) 



We have written this term with contracted fields adjacent in order to avoid 
clutter. There are no distinct permutations, so there is nothing to cancel the 
1/2!. In addition, there are only 4! ways to do the contractions, so there is 
an uncancelled factor of 1/4! as well, and hence, an overall factor of l/2!4!. 
Consider, for a moment, how this works at n th order. There will be a factor 
of n\. If this is incompletely cancelled by permutations of the (xj, tj)'s, there 
will be a factor, l/S (in the above, S = 2). In the next chapter, we will 
see that the symmetry factor, S, is related to the symmetries of Feynman 
diagrams. In addition, there will be factors arising from the incomplete 
cancellation of the (l/4!) n . In the above, this additional factor is 1/4!. 

Again, there are other second-order contributions to the vacuum-to- 
vacuum amplitude which result from contracting fields at the same point, 
but they will give a different contribution which is different in form from 
the one above. One such is the following: 

(0;t = oo|LT(-oo,oo)|0;t = -00) = 

••• + ' J d 3 x 1 dt 1 d 3 X2dt2 d k u k (xi, h)d k u k (xi, h) d k u k (x 2 , t 2 )d k u k (x 2 , t 2 ) x 

, * v , * V 

d k Uk(xi,ti)d k Uk{x2,t2) dkUk{xi,ti)d k Uk(x2,t2)(0;t = oo|0;t = -00) 



+ ... 



(3.51) 



There are 4 • 3/2 ways of choosing the two fields at (x±,ti) which are con- 
tracted and 4-3/2 ways of choosing the two fields at (a^,^) which are 
contracted. Finally, there are 2 ways of contracting the remaining fields 
at (xi,ti) with those at (x2,t2)- This multiplicity incompletely cancels the 
l/2!(4!) 2 . 
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As another example, consider 
H<?/4!) 2 



d xidti d x 2 dt 2 : d k u k d k u k d k u k d k u k (xi,t{) d k u k d k u k d k u k d k u k (x 2 ,t 2 ) 

(3.52) 

This contributes to the amplitude for processes in which both the initial and 
final states contain one longitudinal phonon. There are 4-4 ways of choosing 
the Ui's which create the incoming phonon at (x±,ti) and annihilate the 
outgoing phonon at (x 2 ,t 2 ). There are 3! ways of contracting the remaining 
tij's. Finally, (xi,ti) and (x 2 ,t 2 ) can be permuted. This gives an overall 
factor of 4 • 4 • 3! • 2, which incompletely cancels the (1/2!) • (1/4!) 2 , leaving 
1/3!. 



CHAPTER 4 



Feynman Diagrams and Green Functions 



4.1 Feynman Diagrams 

Feynman introduced a diagrammatic notation which will help us system- 
atically enumerate all of the perturbative contributions which we generate 
using Wick's theorem. This diagrammatic notation will have the added ben- 
efit of having a simple physical interpretation which will guide our intuition 
about physical processes. 

Suppose we want to construct a matrix element at n th order in pertur- 
bation theory. We draw a diagram containing n vertices with 4 lines ema- 
nating from each vertex. Each such vertex represents a factor of (c^it/;) 4 . 
The lines emanating from the vertices can be connected. Each such con- 
nection represents a contraction. We will call such a line an internal line. 
The remaining (uncontracted) lines - external lines - represent incoming 
and outgoing phonons. We will adopt the convention that incoming phonon 
lines enter at the left of the diagram while outgoing phonon lines exit at the 
right of the diagram. 

The first contribution which we considered in chapter 3 (3.47) can be 
represented as: 

Given such a diagram - a Feynman diagram - you can immediately 
reconstruct the expression which it represents according to the following 
rules: 

• Assign a directed momentum and energy to each line. For external 
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Figure 4.1: The diagram corresponding to (3.47). 

lines, the momentum is directed into or out of the diagram for, respec- 
tively, incoming and outgoing phonons. 



• For each vertex with momenta, energies (pi, wi), . . . , {pa-,u>a) directed 
into the vertex, write: 



• Imagine labelling the vertices 1,2, ... ,n. Vertex i will be connected 
to vertices ji, ■ ■ ■ ,j m (m < 4) and to external momenta pi, . . . ,PA-m- 
Consider a permutation of these labels. Such a permutation leaves the 
diagram invariant if, for all vertices i, i is still connected to vertices 
ji , . . . , j m (m < 4) and to external momenta pi , . . . , p4- m . If 5* is the 
number of permutations which leave the diagram invariant, we assign 
a factor 1/5 to the diagram. 

• If two vertices are connected by I lines, we assign a factor 1/Z! to the 
diagram. 

You can verify that by applying these rules to figure (4.1) we recover (3.47). 

For the particular interaction we have chosen, we can ignore the trans- 
verse phonons - since they don't interact - and consider only the longitu- 
dinal phonons. If we were to consider a model in which both longitudinal 



— * — * 

• For each external line with momentum k, write \k\. 



• For each internal line with momentum and energy p, to write: 




g (27r) 3 (5(pi + P2 + P3 + Pa) 2tt5(uji +uj 2 + u)^ + oja) 
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Figure 4.2: All connected Feynman diagrams for the theory (3.1) to 0(g 2 ). 
In (a), we have the diagrams of order g. In (b), we have the diagrams of 
order g 2 . 

and transverse phonons interact, our Feynman diagrams would have to have 
internal and external indices corresponding to the vector indices of the fields 
Ui, Uj, etc., and our Feynman rules would have to tell us how to contract or 
route these indices. 

In figure (4.2) we display all of the connected diagrams which appear to 
0(g 2 ) in the theory given by (3.1). In the problem set, you will write down 
expressions for them. 

The Feynman diagram representation for transition amplitudes suggests 
a beautiful visualization of perturbative processes. External lines corre- 
spond to 'real phonons' or simply phonons, while internal lines correspond 
to 'virtual phonons'. For the diagram of figure 4.1, we say that the incoming 
phonons with momenta k±,k2 interact at x±, propagate as virtual phonons 
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with momenta pi,ki + k2 — Pi, and finally interact again at X2, thereby 
scattering into the outgoing phonons with momenta k% , k± . For the first dia- 
gram of figure 4.2b, we say that the incoming phonons with momenta k±, ki 
exchange a pair of virtual phonons, thereby scattering into £3, £4. External 
lines correspond to initial or final states with phonons of momentum, energy 
(p, lo). These satisfy to 2 = (u£) 2 . Such a phonon is said to be 'on-shell'. Vir- 
tual phonons need not be 'on-shell'. Indeed, the phonon propagator diverges 
if a virtual phonon is on-shell, thereby signalling a resonance. 

4.2 Loop Integrals 

Suppose we have a Feynman diagram with E external lines, / internal lines, 
and V vertices. Suppose, further, that this diagram has L loops (e.g. the 
first diagram in figure 4.2 has one loop, while the third, fourth, and fifth have 
two loops. The second has no loops.). Then, let's imagine connecting all of 
the external lines at a single point so that the Feynman diagram defines a 
polyhedron with E + I edges; V + 1 vertices - the extra vertex being the one 
at which the external lines are connected; and L + E faces - with E faces 
formed as a result of connecting the external lines. According to Euler's 
fomula, 

(# faces) + (# vertices) - (# edges) = 2 (4.1) 

or, 

L = I-V + 1 (4.2) 

The number of loops is given by the number of internal lines - i.e. the 
number of propagators, each coming with an integral - minus the number 
of vertices, each coming with momentum and energy-conserving 5-functions, 
plus 1 for the overall 5-functions satisfied by the external momenta, energies. 
In short, there are as many (p,cu) pairs to be integrated over as there are 
loops. A diagram with no loops has as many 5-functions as integrals, so the 
integrals can all be evaluated trivially, and there are no remaining integrals 
to be evaluated. Such a diagram is said to be a tree level diagram. The tree- 
level diagrams are indicated in figure 4.3 These diagrams can be evaluated 
without doing any integrals. Note that most of these are not connected 
diagrams. In order to evaluate a one-loop diagram, we need to do one 
/ dujd 3 p integral; to evaluate a two-loop diagram we need to do two such 
integrals, i.e. f f duj\d?pi duid^pi'i and so on. The expansion in loops is 
actually an expansion in powers of Planck's constant, since, as you will show 
in the problem set each propagator comes with a factor of h and each vertex 
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(C) + . . . 

Figure 4.3: The tree-level Feynman diagrams of the theory (3.1). 
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comes with a factor of 1/h. An L-loop diagram comes with a coefficient of 
h L -\ 

Turning now to the evaluation of multi-loop diagrams, we find the fol- 
lowing trick (due to - you guessed it - Feynman) very useful. When we 
integrate the momenta in closed loops, we often encounter integrals of prod- 
ucts of propagators. These are more easily evaluated if we combine the 
denominators of the propagators using the following formula: 



dx- 



o [ax + 6(1 — x)Y 



1 

ab 



(4.3) 



For the more general case of the product of several propagators, which 
can occur in higher orders of perturbation theory, we will use the following 
formula: 



A a Jo 



-At 



(4.4) 



Using this formula, we can write: 



= TT— f 



POO 

Jo , 



(4.5) 



Here, the integral in brackets is equal to 1. Changing variables from ij to xi 
according to tj = sxj, we have: 



_ r r 1 



fy Oi 7 — 1 —qA 

3 s X j ' 



e~ sA ^ -6(1 - J» 



■3) JO JO 



n,r(« 



j) Jo 



dx\ . . . dx n 5(1 — 



a-i — l 



(>: ; -'> 1 ,) 



(4.6) 



To see why these formulas are useful, consider the evaluation of diagram 
4.1. We have 



g 2 f dw\ d 3 p\ 



\ki + k 2 -pi\ 



2p J 2vr (2vr) 3 , , 2 2 (r , T -V , x ~ v?Pi + iS 

r y ' (ei + e 2 -wi) - vfl ki + k 2 - pi J + id 1 iri 

(4.7) 



4.2. LOOP INTEGRALS 



49 



This can be brought into a more useful form using (4.3) with 

a = (ei + e 2 - ^>i) 2 - vf (%i + k 2 - Pi) + iS 

b = u\- v?p? + i5 (4.8) 

Using (4.3), we can write 

\k\ + k 2 -pi\ 2 \pi\ 2 



(e 1 + e 2 -u; 1 ) 2 -v 2 (k 1 + k 2 ) + iS ^ ~ ^ + i6 

= j dx 
Jo 



|pi| 2 |/ci + k 2 -pi\ 2 



dx 



^(ei + e 2 - ^i) 2 - uj 2 ^! + fc 2 - Pi) 2 + i<5^ x + (uj\ - vfp? + iS) (1 - x) 

\pi\ 2 \ki + k 2 -p\\ 2 



tot 



vfp 2 + x((ei + e 2 ) 2 - fjVi 2 ) - 2xwi (ei + e 2 ) + 2x^Vi ■ (&i + k 2 - p^j + i<5 



If these integrals were from — oo to oo, then we could shift the variables of 
integration without worrying. In condensed matter physics, these integrals 
are always cutoff, so we must be a little more careful; in our phonon theory, 
the momentum cutoff, A, is the inverse lattice spacing and the frequency 
cutoff, A w , is the Debye energy. However, so long as the external momenta 
and energies are much smaller than the cutoffs, i.e. k{ <§C A, &i A w , 
we can shift the variables of integration and neglect the effect of this shift 
on the range of integration. Thus, we proceed by changing the variables 
of integration to u = uj\ — x(e\ + e 2 ), q = p\ — x(k\ + k 2 ). Writing a = 
x(l — x)((ei + e 2 ) 2 — vf(k\ + k 2 ) 2 ) we can write the loop integral as: 

duj d 3 q q 4 | f duj d 3 q q 2 (ki + k 2 ) 2 (x 2 + (1 - xf + §x(l - x)) 



2^(2vr)3 ( w 2_„2^2 + )2 J 2 ^(2vr)3 (u 2 - vfq 2 + a) 

du d 3 q x 2 (l - x) 2 (ki + k 
2^(2^)3 (u2- v tf2 + a ) 



fdw d?q x 2 (l-x) 2 {k 1+ k 2 f 
J 2^(2vr)3 (tf-tfffi + a) 2 



Integrals of this form are often divergent. If we forget about the momen- 
tum and frequency cutoffs, then these integrals are ultraviolet divergent. If 
we're careful and we remember that these integrals are cutoff in the ultravi- 
olet, then we will get finite (albeit cutoff-dependent) answers. On the other 
hand, these integrals are infrared divergent if a = - i.e. if e«, ki vanish. 
This is a real, physical effect: phonon Green functions do diverge when the 
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momenta, energies tend to zero. We will study the power-law forms of these 
divergences when we turn to the renormalization group in chapter 11. 

It will sometimes be important to distinguish between the frequency 
cutoff, A w , and the momentum cutoff, A. Often, however, the distinction is 
unimportant and we can assume that there is a single cutoff, A^ = viA. In 
such a case, we can simplify (4.9) by using analytic continuation. 

For either sign of \q\ 2 — a, the poles in u> are in the second and fourth 
quadrants. Hence, the contour of integration can be harmlessly rotated in 
an anti-counter-clockwise direction from the real co axis to the imaginary to 
axis. If we write q^ = —iu> and q 2 = q\ + vfq 2 , then (4.9) is equal to 



(2vr)4 (_ g 2 + a f + v f J (2vr)4 (_ g 2 + a f 



+ 



d 4 q x 2 {l-x) 2 {k 1 +k 2 f 



(2tt)< 



{-q 2 +aY 



(4.10) 



or 



ifhM + 4f7^i + fc 2 ) 2 (x 2 + (i 

v i v i ^ 



-x(l - x) I 2 ,i(a 



+ \ (x\l - x) 2 (h + k 2 ) A ) I 2fi (a) (4.11) 



where the integrals which we need to study are: 

d 4 q q 2m 



(2vr) 4 {-q 2 +a) n 



or, setting z = q 2 , and V(S 3 ) = 2tt 2 , 

dz 



In,m(p>) 



A 2 z m+l 



(-ir 

(n - 1)! da™- 1 \ 16tt 2 J (-z + a) 



{-z + a) 
d"- 1 ( 



z m+1 dz 



(-1)" d 



n-1 



(n-1)! da n ~ x \ 16tt 2 



J —a 



(u + a) m+1 du 



u 



n-1)! da- 1 \ k U 

' \k=0 v 7 



m+l—k 



16tt 2 (n 

i (-l) n d 
16-7T 2 (n — 1)! da 



n—l 



-L^lnu+Vl m+I 
n— 1 I / j 

k=0 



(4.12) 



A 2 — a 



k „m+l— k 



k 



u a 
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fc=0 

Hence, we finally obtain: 

i 



16vr 2 (n-l)! da n - lK \ -a J 

m+1 1 , , -i \ 

m ^> m+1 -* (A 2 -a) fc -(-a)* ) (4.13) 

t — n \ / 



r jf'dxt^ (^ 2 (1 - z) 2 (£i + fe 2 ) 4 ) {-^A 4 - 3A 2 x(l - x^vfih + k 2 f - (e 1 + e 2 ) 2 ) 

-(x(l - x)(^ 2 (fc! + £ 2 ) 2 - ( €l + e 2 ) 2 )) 2 

+3fx(l-x)(v 2 (fc 1 + k 2 f- (ei + e 2 ) 2 ) N ) 2 ln f ^ )} 

^ ' ^ U(! - ^)K 2 ( fc i + M 2 - (ei + 6 2 ) 2 ) J 

+~(h + k 2 f (x 2 + (1 - x) 2 + U(l - x)) {A 2 + x{\ - x^vfih + k 2 ) 2 - (ei + 6 2 ) 2 ) 



^(fci + fc 2 ) 2 (x 2 + (l-x) 2 + ^ ^W*2 , „,n , Ctf , ^ 
-x(l - x)(v 2 (ki + h) 2 - (e 1 + 6 2 ) 2 ) In [ — „, r A * — ) } 



x(l-x)(t; 2 (A: 1 + A: 2 )2-( ei + e2 )2) 
A 2 



+ fc 2 ) 2 -(ei + e 2 ) 2 ) 



-.ji 

(4.14) 



To summarize, we evaluate a Feynman diagram by the following steps: 

• Use the Feynman rules to obtain a loop integral. 

• Combine the denominators using Feynman's trick. 

• Shift the variables of integration to make the denominator invariant 
under uj — > u>, p — > —p. 

• Analytically continue to imaginary frequencies. 

• Use rotational invariance to reduce the integral to an integral over a 
single variable for each u>,p. 

4.3 Green Functions 

In the preceding discussion, we have implicitly assumed that external phonon 
lines are 'on shell', i.e. they satisfy uj 2 = (lo 1 p ) 2 . It does, however, make sense 
to relax this requirement and allow even the external phonons to be "off- 
shell". One reason is that we may want to define a Feynman diagram - or 
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Figure 4.4: The definition of the 4 point Green function 



a set of diagrams - which can be part of a larger diagram. In such a case, 
the lines which enter this part might not be on-shell. 

Consider the diagram of figure 4.4a. The shaded circle represents all 
possible diagrams with 4 external legs. The first few are shown in figure 
4.4b. We will call such an object 

G{pi,P2,P3,P4> (4.15) 

(We will usep as a shorthand for p, oj.) G(pi,p2,P3,P4) is defined to include 
the momentum conserving 5 functions, 

(2ir) 3 5(pi +P2+P3+ Pa) 2tt5(uji + uj 2 + ^3 + <^a) 
and a propagator 

\Pi 



~ V lPi + *<* 

on each external leg. 

We can define similar objects - called Green functions - for any number 
of external legs: 

G( Pl ,p 2 ,...,p n ) (4.16) 
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It is given by the sum of all diagrams with n external legs with (possibly 
off-shell) momenta and energies p±,p 2 , ■ ■ ■ ,p n with a propagator assigned to 
each external leg. We can Fourier transform the Green function to obtain 
the real-space n-point Green function, G(x\, . . . , x n ). 

While the notation G(x\, . . . , x n ) is generically used for Green functions, 
the phonon two-point Green function is often denoted D(xi,x 2 ). However, 
we will reserve this notation for the two-point function of the other phonon 
field ipi, which is more natural in some contexts. 

The name 'Green function' is due to the fact that when the interaction 
is turned off, i.e. g = 0, the two-point Green function is a Green function of 
the differential operator 

pSikdf - (/i + A) did k - nS ik djdj (4.17) 

This follows since the two-point Green function is just the derivative of the 
propagator: 

G(xi,x 2 ) = didj (T (u i (x 1 ,t 1 )u j (x2,t 2 ))) 

~ 13 pj (27T)3 27T UJ 2_ { ^Y+iT ' 

It therefore satisfies 

(pS ik dt - {n + A) did k - uSikdjdj) G(x 1 ,x 2 ) = i5 (f i - x 2 ) 5 (h - t 2 ) 

(4.19) 

as you showed in the first problem set. 

4.4 The Generating Functional 

Let's modify our Hamiltonian by adding a 'source term', 

H^H + j d 3 xj(x,t)d k u k (x,t) (4.20) 

The source, j, is some arbitrary, prescribed function. We can think ofj(x, t) 
as a knob which we can turn in order to set up compressional waves in the 
solid. By measuring the system at (x',t'), we can study the propagation of 
sound waves. 

Our interaction Hamiltonian is now Hj + f o9x j(x)d k u k (x), so our Feyn- 
man rules must be expanded to include a new vertex - which we will call 
a 'source vertex' -with only one line emerging from it. If this line has mo- 
mentum, energy p, u>, we assign —ij(p,u) to it (j is the Fourier transform 
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Figure 4.5: Some vacuum-to-vacuum diagrams in the presence of an external 
source. 

of j). Let us now look at the vacuum-to- vacuum amplitude, which we will 
call Z\j}: 

z[j] = (o|r{ e -^ H I+ jd k u k ^ ^ 2V) 

This is given by the sum of all diagrams with no external legs. Several of 
these are shown in figure (4.5). We have denoted the new vertex by a large 
dot with a j next to it. To make life easy, let us shift the zero of energy by 
adding a constant to the Hamiltonian, H — ► H + Eq and choose Eq so that: 

Z[0] = 1 (4.22) 

Hence, the sum of all of the diagrams with no source vertices is 1. Consider 
a diagram with n source vertices. It will have an amplitude proportional 
to j(pi, ui) . . . j(p n , Each j(pi,(Vi) creates a phonon with momentum 
Pi,coi. These n phonons enter the diagram along the external legs of the 
Green function G(pi, . . . ,p n ). We then have to integrate over all of the p^s 
with a factor of 1/n! to avoid overcounting due to permutations of the p^s. 
Hence, 

Z \j] = 1 + /2 j - / dZ ^ 1 duJl '-- d3 P n duJn it? 1 ) ■ ■ ■ j(Pn) G(Pl ,---,Pn) 

n=l 

(4.23) 
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We can Fourier transform this expression into real space: 
00 (—i) n f 

Z \j] = l + Z~L ~ / d 3 x 1 dti...d 3 x n dt n j(x 1 )...j(x n )G(x 1 ,...,x n ) 

n=l n - J 

(4.24) 

We can understand the Green function in another way by considering 
the Hamiltonian with a source term, 

H -> H + j<9 fc u fc 

= f + h' + i^fcUfe 

= (flo + fr')+^«fc ( 4 - 2 5) 

We can now treat Hq + H' as our 'free' Hamiltonian and jdkUk as our 
interaction Hamiltonian. Since Hq + H' is not actually free, we can't use 
Wick's theorem, but we we can still use Dyson's formula. The 'interaction' 
representation for this 'free' Hamiltonian is actually what we would call the 
Heisenberg representation for j = 0, so we will put an H superscript on all 
fields. Using Dyson's formula, we can express Z[j] as: 

Z[j] = (0\T j e -*/d 3 zdi j(2,t)d fc Mf (:r,t) j |q^ 

= 1 + — p- / d 3 xidh...d 3 x n dt n j(x 1 )...j(x n ){0\T(d k u%(x l )...d k u%(x n )) |0> 

n=l n - J 

(4.26) 

Comparing this with our earlier expression for Z[j], we see that the 
Green function is given by: 

S n Z\i] 

G(x 1 ,x 2 ,...,x n ) = (0\T(d k u^(x 1 )...d k u^(x n ))\0)- 



Sj(x 1 )...5j(x n ) 
(4.27) 

In other words, the Green functions are the vacuum expectation values of the 
T-ordered product of a string of (Heisenberg picture) fields. These vacuum 
expectation values are the coefficients of the Taylor expansion of the vacuum- 
to-vacuum transition amplitude in the presence of an external source field. 
While our earlier definition - as a sum of Feynman diagrams - is convenient 
for perturbative calculation, the present definition as a vacuum expectation 
value is far more general since it is non-perturbative and can be compared 
with experiments. These vacuum expectation values are also called time- 
ordered correlation functions. 
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4.5 Connected Diagrams 

There is a very useful theorem which states that 

Z[j] = e w W (4.28) 

where Z[j] is the sum of vacuum-to- vacuum Feynman diagrams we defined 
above and W[j] is the sum of connected vacuum-to-vacuum diagrams. To 
prove this theorem, observe that a given diagram which contributes to Z[j] 
will, in general, be made up of several different connected diagrams and 
it will factor into the contributions from each of them. We can assemble 
the set of all vacuum-to- vacuum diagrams by putting together n\ connected 
diagrams of type 1, n<i connected diagrams of type 2,. . . , n r connected dia- 
grams of type r, etc. The contribution of such a diagram to Z[j] will be the 
product of the contributions, C r , of its connected components: 

oo oo oo f-fn 

ni=0ri2=0 r=l 

The n r \ in the denominator is the symmetry factor resulting from the per- 
mutations of the n r identical connected components of type r. Commuting 
the sums and the product: 

oo / oo ™ \ 

r=l W=0 / 

oo 

= IF' 

r=l 

= e w ® (4.30) 

This theorem - sometimes called the linked cluster theorem - will be 
particularly useful when we construct a diagrammatic expansion for the 
partition function, Z = tr{e~^ H ). In order to compute the free energy, 
F = —TlnZ, we need only compute the connected diagrams. 

The Taylor expansion of W[j] is: 

°° ( '\TL I* 

W[j} = W[U]+Y)-^r J d 3 x 1 dt 1 ...d 3 x n dt n j(x 1 )...j(x n )G c (x 1 ,...,x n ) 

71=1 

(4.31) 
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where the G c 's are connected Green functions. The two-point connected 
Green function is given by: 

G c (x!,x 2 ) = <0|r(^« fc (xi)Sfc«fc(x 2 ))|0) - {0\d k u k ( Xl )\0) (0\d k u k (x 2 )\0) 
= (0\T ((d k u k ( Xl ) - (d kUk ( Xl ))) (d k u k (x 2 )) - {d k u k {x 2 )))(\6$2) 

This correlation function is often more useful since it measures fluctuations 
around mean values. 



4.6 Spectral Representation of the Two-Point Green 
function 

The spectral representation of the two-point Green function has the advan- 
tage of being intuitive yet well-suited for rigorous statements. It is obtained 
by inserting a complete set of states into 

{T(d iUi (x,t) Uj (0,0))) = 6(t){diUi(x,t)djUj$,G)) + e(-t)(d j u j (0,0)d i u i {x,t)) 
= e(t)^2{0\ d iUi (x, t) d jUj (0, 0) | 0) 

i 

+ 0(-*)X)(O |^.(0,0)| i) (i\dm@i,h)\0) (4.33) 

i 

By translational invariance, 

(0 \d iUi (x,t)\ i) = e^-^ (0 1^(0,0)1 i) (4.34) 

where pi and u>i are the momentum and energy of the state | i) . 
Hence, we can write the Green function as 

(T(d iUi (x, t) Uj (0,0))) = J>|c^(0,0)| 0)| 2 (9{ty®-*- u & + ^(-t)e-^--^)) 

d 3 PdE \d jUj (0,0)\ 0}\ 2 5{P -pl)5(uji - E) 

x {e{t)e^ p - s - E ^ + e{-t)e-^ p - s - Et ^ 

= j d 3 PdE\P\ 2 B(P,E) (e{t)e i{p - s - Et + e{-t)e-^ P s - Et ^ 

(4.35) 

Here, we have introduced the spectral function, B(P, E), so that \P\ 2 B(P, E) 
is given by the quantity in brackets. In order to take the Fourier transform, 
we have to add ids to make the t integral convergent, so that we obtain: 

G{x,w) = j dt j d 3 PdE\P\ 2 B(P,E) (o( t y(P-s+("-E+it)t) + 0(_ t ) e -i(j*.5f-( w +£-i 
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j dt j d 3 PdE\P\ 2 B(P,E) (e{t)e i{w - E)t - &t e ip -* + 9{-t)e i{u+E)t+St e~ iP ^ 

! 



<eP dEt \ P m p -E) (-f^j - (4.30) 



or 



G ^^[ dEtlpf (^^ ItML) (4 . 37 ) 

J \uj — E + id oj + E — id J 

For a parity-invariant system, B(p,E) = B(—p,E), so 

G&u) = JdEiW* ™*®^ (4.38) 



From its definition, B(p,E) is non-negative. If the phonons are non- 
interacting, B(p, E) = 5(E 2 —Up), and we recover the free-phonon two-point 
function. 

We can split the sum over i into the vacuum state, the one-phonon states, 
and all other states. Now, let us assume that 

<0|ui(£i,ti)|0) = (4.39) 

If it didn't, this would be the statement that there is some kind of static dis- 
tortion present in the ground state. We could shift Ui(x\,ti) by Ui(x±, ti) — ► 
Ui(x\,t\) — (0|uj(xi, ii)|0) and we would have the above for our new dis- 
placement field. 

Consider a one-phonon state of momentum p. Then, we will write: 

|p| 2 Z = |(0|d^(0,0)|p)| 2 (4.40) 

Rotational and Galilean invariance imply that the left-hand-side is indepen- 
dent of the direction of p. 

Then the spectral function can be broken into a piece, carrying weight Z, 
which looks like a non-interacting phonon, and the remaining 'incoherent' 
weight: 

B{p, E) = Z5{E 2 - W J) + B inc (p, E) (4.41) 

The phonon propagates as a free phonon with probability Z and as a multi- 
phonon state of energy E with probability B UiC (p, E). 



4.7. THE SELF-ENERGY AND IRREDUCIBLE VERTEX 



59 




Figure 4.6: The relation between II and G. 

4.7 The Self-Energy and Irreducible Vertex 

The two-point Green function G(p\,p 2 ) is given by the diagrams in figure 
(??). To zeroth order in g, it is simply the free phonon propagator. There 
is an 0(g) correction given by the diagram of figure (4.2) which leads to 



G( Pl , P2 ) = (2tt) 6 6(p 1 + p 2 ) 2vr<5(a;i + u 2 ) ( \ptf — 2 

+ffei 2 ., L.J 1 , ..L , r, + °(<f-)) 



(4.42) 



For the two-point Green function, we can do better without doing much 
more work. Let us define the one-particle irreducible, or 1PI n-point Green 
function as the sum of all the Feynman graphs contributing to the n-point 
Green function which cannot be made disconnected by cutting a single in- 
ternal line (this is a subset of the set of connected diagrams). For the 1PI 
n-point function, we do not include propagators on the external legs. The 
1PI two-point Green function is given by Il(p,uj)/p 2 ; U(p,uj) is called the 
self-energy because the two-point Green function can be expressed in terms 
of it according to the graphical relation of figure 4.6. Summing this geomet- 
rical series, we have: 

G(pi,p 2 ) = (2nf 5 (p! + p 2 ) 2715(10! + u 2 ) \p[\ 



ujj - vfpf - II(pi,cJi) + iS 
(4.43) 
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Figure 4.7: The relation between the regular and 1PI four point Green 
functions. 

From our calculation above, we see that the self-energy is given by: 

n <^» = I ^ / (0 S " f ^WTVs + °^ (4U4) 

In the problem set, you will show that lm{U(p,uj)} is related to the phonon 

lifetime. 

The coherent weight in the phonon spectral function, Z, is given by: 

Z' 1 = 1 - (i Re ( n )) ( 4 - 45 ) 

V U / U!=Vp 

We can also define a 1PI 4-point Green function, T(pi,p2,ps,P4). The full 
Green function G(pi,p2,P3,P4,) can be expressed in terms of T{pi,p2,P3,p^) 
and the two-point Green function G(pi,p2) according to the graphical rela- 
tion of figure 4.7. 



CHAPTER 5 



Imaginary-Time Formalism 



5.1 Finite- Temperature Imaginary- Time Green Func- 
tions 

In the previous chapter, we found that the mathematical trick of analytically 
continuing to imaginary frequencies, co — > iuj, facilitated the calculation of 
the integrals arising from Feynman diagrams. As we will demonstrate in 
this chapter, it is extremely convenient to work with imaginary-time from 
the outset. Such an imaginary-time formalism will have the advantage of 
having a natural extension to arbitrary temperature. It can also, in many 
cases, serve as a preliminary step in the calculation of retarded correlation 
functions which - as we will discuss in the next chapter - are the quantities 
most closely related to physical measurements. 



We make the analytic continuation it — > r and define the following object 
for < r < (3: 



Q(x — x,t — t') 



6{t - T r )Tr {e-^ e rH d k u k {x)e- rH e T ' H djU^y^' H } 
+6{t' - r)Tr {e- pH e r ' H d jUj {x')e- T ' H e rH 8 k u k {x )e" rH } 
0(t - t')Tt \e- pH d k u k {x,T)d jUj {x! ,t')} 
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CHAPTER 5. IMAGINARY-TIME FORMALISM 



We have passed from the Schrodinger representation in the first line to an 
imaginary-time Heisenberg representation in the third line. In the final line, 
we have defined the imaginary-time-ordering symbol, T T , by analogy with 
the real-time symbol, T: operators are arranged from right to left in order 
of increasing r. In a similar way, we can define the imaginary-time-ordered 
product of strings of fields. If T\ > T2 > . . . > T n , then 

(T T (Oi . . . O n )) = Tr [e-P H 1 . . . O n ] (5.2) 

Rather than take the expectation value in the gound state, we are averaging 
the expectation value over all energy eigenstates, |n), weighted by e~^ En . If 
we had used it rather than r, we would have the finite-temperature time- 
ordered Green function. By working with r, we will construct a Green 
function from which the retarded Green function can be constructed by 
analytic continuation. We will also exploit the formal analogy between the 
time-evolution operator e~ ltH — > e~ rH and the Boltzmann weight, e~@ H . 
In analogy wth the real-time case, we write 

U(r 2 ,r 1 ) = e-^-^ H (5.3) 

We will add a source to the Hamiltonian and view the partition func- 
tion, Z[j] = Tr{e~@ H } = Tr{U((3,0)}, as the generating functional for 
imaginary-time Green functions. In the perturbative expansion of U(/3, 0), 
we will only encounter fields with imaginary-time arguments in the interval 
[O,0\. 

There is a further condition which follows from the cyclic property of the 
trace. Since < r, r 1 < /3, it follows that — (3 < r — r' < [3. Now suppose 
that t < t'. Then, 

Q{t -t' < 0) = Tr {e-^ e T ' H d j u j (x' )e~ T ' H e rH d k u k {x)e~ TH } 

= Tr {e TH d k u k {x)e~ TH e~^ H e T ' H d jUj {x' )e~ T ' H } 

= Tr [e~P H e? H e TH d k u k (x )e~ TH e^ H e T ' H d j u j (x' )e^' H ] 
= g(r-r' + f3) (5.4) 

The first equality follows from the cyclic property of the trace. The final 
equality follows from the fact that r — r' + (3 > 0. 

As a result of periodicity in imaginary-time, we can take the Fourier 
transform over the interval [0,/?]: 

G{iuj n )= / dre w - T Q{r) (5.5) 
J 
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where the Matsubara frequencies uj n , are given by: 

2nir 



UJ r , 



Inverting the Fourier transform, we have: 



Q{r) = -^y^ n )e-^ 



(5.6) 



(5.7) 



In the absence of interactions, we can evaluate the imaginary-time two- 
point Green function directly. Using the Planck distribution, 



1 



and substituting the mode expansion of u k , we have: 

G{x,t) = 0(T)Tr{e-P Ho d k u k (x, r)d jUj (0,0)} 

+6(-T)Tr{e- /3Ha d j u 3 (0,0)d k u k (x,T)} 



(5.8) 



/ 



d 3 k 



[9(t) ((n B (u k ) + l) e *™- + n B (wk)e~ < *" if+WfcT ) 
+9{-t) (n B (u k )e iU - 3 -^ T + (n B (u k ) + l) e -**+^}?) 



We can now compute the Fourier representation of the Green function: 

g{p,iuj n ) = J Sx^ s J f ^ dre iulnT Q{x,T) 

_ \p\ 2 f(nB(u P ) + l)(e-^'-l) , n B (u p )(e^-l) 



-1 1 

+ 



2u> k 

2uo k \ iu n — uj p ' iu) n + uj p 



+ 



lUJ n + UJ p 



(5.10) 



In real-space, this is: 
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As we can see from the above derivation of the Green function, in the 
imaginary-time formalism, we have the decaying exponential e~ WT rather 
than the oscillatory e* w *, so we can dispense with the i<5's which are needed 
to ensure convergence in the real-time formulation. Indeed, from a mathe- 
matical point of view, imaginary-time is simpler than real-time; sadly, nature 
forces us to live in real-time. 



5.2 Perturbation Theory in Imaginary Time 

Following our real-time development, we consider a Hamiltonian of the form 
H = Hq + i?mt, and go to the imaginary-time interaction representation. In 
this representation, 

U(n,n) = T T { e - (5.12) 

Hence, the imaginary-time Green function , which takes the Schrodinger 
picture form: 

g(x,r- t') = 9{t - r')Tr {e - ^ e rli d k u k (x)e~ TH e T ' H d jUj {0)e- T ' H } 

+9{t' - r)Tr {e - ^ e T ' H d jUj (0) e~ T ' H e rH d k u k (x )e~{^) 

can be written in the interaction picture as: 

Q(x,t- t') = 6{t - T ')Tr{e-P Ho U(P, 0) U~\t, 0)d k u k (x, r) U(t, 0) x 

U-\T / ,0)d j U j (0,T , )U{T , ,0)} 

+9{t' - T)Tr{e-P Ho U{(3, 0) U~ l (r', 0)d jUj (0, r') U{t', 0) x 
U-\t, 0)d k u k (x, t)U(t,0)} 
= 9{t - T ')Tr{e-P Ho U(P, r) d k u k (x, r) U(t, r') d jUj (0, r') U(t', 0)} 

+9(t' - r)Tr {e'^ H " U(J3, r') d jUj (0) U(r, r')d k u k (x)U(r, (#}l4) 

or, simply, as 

Q(x, t-t')= Tr {e-^°T T (U(P, 0) d k u k (x, r) d jUj (0, r')) } 

= <T r (U(p,0)d k u k (x,r)d jUj (0,T'))) (5.15) 

As we noted earlier, only imaginary times r 6 [0,/?] appear. 

To evaluate this perturbatively, we expand U((3,0), as in the real-time 
case: 

& 



J2L(_iy» rP rP 
fo n! Jo 
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(T T {d k u k (x, t) d jUj (0, t') H int ( Tl ) . . . i? int (r n ))>(5.16) 



We can show that Wick's theorem holds for the imaginary-time-ordered 
product and use it to evaluate the above expectation values. Following our 
real-time development in this way, we can use Feynman diagrams to evaluate 
the perturbation series. The differences are that in imaginary-time, 

• To each line, we associate a momentum, p and a Matsubara frequency, 

LO n . 

• For each external line with momentum p, write \p\ . 

• The propagator assigned to each internal line is: 



• For each vertex with momenta, Matsubara frequencies (pi, u) ni ), . . . , (p±, 
directed into the vertex, we write 



• Imagine labelling the vertices 1,2, ... ,n. Vertex i will be connected 
to vertices ji, ■ ■ ■ ,j m (m < 4) and to external momenta pi, ■ ■ ■ ,p±- m . 
Consider a permutation of these labels. Such a permutation leaves the 
diagram invariant if, for all vertices i, i is still connected to vertices 
ji j • • • > 3m ( m < 4) and to external momenta p\ , . . . , p^-m ■ If 5* is the 
number of permutations which leave the diagram invariant, we assign 
a factor 1/S to the diagram. 

• If two vertices are connected by I lines, we assign a factor l/l\ to the 
diagram. 

Using our result on connected Feynman diagrams from chapter 5, we see 
that the free energy, F, is given by 




g (2tt) 3 5(pi +P2+P3+ Pi) (3 <5 ni+n2+n3+n4 ,o 



-f5F = In [Tr {£/(/?,())}] 




All connected diagrams with no external legs (5.17) 
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5.3 Analytic Continuation to Real-Time Green Func- 
tions 

In spite of their many charms, imaginary-time Green functions cannot be 
directly measured in experiments. Hence, we must contemplate real-time 
Green functions. In fact, it is useful to consider complex variable, 

and to analyze the properties of Q{t) as r varies through the complex plane. 
When r lies on the real axis, we have the imaginary-time Green function: 

G(x-x',t- t') = 6{t - r')Tr \e~ pH e~ rH d k u k (x)e rH e~ T ' H d jUj {x')e T ' H } 

+9(t' - r)Tr {e~ pH e~ T ' H d jUj {x')e T ' H e~ rH d k u k (x)e rH } 

(5.18) 

When r is on the imaginary axis, r = it, we have the real-time Green 
function: 

G{x-x',t- t') = 9(t - t')Tr {e-^ e~ im d k u k (x)e itH e'*' H djUj^y* H } 

+9{t' - t)Tr {e~P H e~ u ' H d jUj (x')e it,H e~ itH d k u k (x)e itH ] 

(5.19) 

For arbitrary complex r, Q{t) interpolates between these two. G{t) is not, 
however, an analytic function over the entire complex plane, as we can see 
from its spectral representation. We follow our earlier derivation of the 
spectral representation for the T = real-time ordered Green function. The 
principal difference is that we now have e~@ En \n) rather than |0). Hence, 
by inserting a complete set of intermediate states, \m)(m\, we have, in lieu 
of (4.35), 

G(x, t)= l Spdcu^ip- Pm + Pn)S(u - u nm ) (d^e^-^ e-? E " 

n,m 

+0(_ r )) e -#-***r e -f3E m ^ m 1^.(0,0)1 n$]20) 

The Fourier transform, 

g(p, %u)j) = J d *xJ^ dT T ) ei0JjT ( 5 - 21 ) 

is given by: 

g{p,iUj)= j dE[J2 (e-^-e-^) \(m \d iUi (0, 0)| n)| 2 

n,m 
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x S{p -p m + p n )S(E -E m + E n )\ - - (5.22) 

hi — ILOj 

Writing 

p 2 B(p,E) = (e _/3En " e _/3£m ) |(m|^(0,0)| n)\ 2 5(p-p m +p n ) 5(E-E„ 

n,m 

(5.23) 

we have the spectral representation of Q: 

e{R ,^ = r dE ^mB ( 5. 2 4) 

J-oo E - tUJj 

As usual, the spectral function B(p,E) is real and positive. 

Q is not analytic as a result of the singularities in (11.53). Hence, it does 
not satisfy the Kramers-Kronig relations. However, the functions 



/_ 



^Ep^L (M6) 
E - uj ± id 



are analytic functions of uj in the lower and upper-half planes respectively. 
Consequently, they do satisfy the Kramers-Kronig relations. As you will 
show in the problem set, these are the advanced and retarded correlation 
functions defined in the next section: 

G iet (M=l dE p2mE) 



= J 



E-LO-iS 

oo m 2 



G adv (p,c)=/ dE I m f\ (5.26) 

Note that the spectral function is the difference between the retarded 
and advanced correlation functions. 

G iet {p, uj) - G a dv(p, tu) = 2-Kip 2 B(p, uj) (5.27) 

5.4 Retarded and Advanced Correlation Functions 

In the previous chapter, we dealt with the time-ordered two-point correlation 
function, 

G(xi,ti;x 2 ,t 2 )= 6{ti - t 2 ) (diUi(xi,ti)djUj(x2,t2)) 

+ 6{t 2 - h) (djUjix^t^diUiixuh)) (5.28) 
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In this chapter, we have introduced the imaginary-time two-point correlation 
function: 

g(x-x',r- t') = 6(t - T r )Tr {e-P H d k u k (x,T)d jUj (x" ,t')} 
+ 6{t' - T)Tr {e-^dju^x' ,T')d k u k (x,T)} 

(5.29) 

To this family of Green functions, we have now added the retarded and 
advanced correlation function. As we will see in the next chapter, the re- 
tarded correlation function is often more useful for comparison with ex- 
periments. At zero temperature, the retarded and advanced correlation 
functions are given by: 

G ret (xi,t 1 ;x2,t 2 ) = 9(ti - t 2 ) (Q\[diUi(x 1 ,t 1 ),d j Uj(x2,t 2 )]\0) 
G , a dv(xi,ti;x2,t 2 ) = 0(t 2 - h) (0\[d j u j (x2,t 2 ),diUi(x 1 ,t 1 )}\0} (5.30) 

At finite temperature, these are generalized to: 

G ic t(xi,t 1 ]x 2 ,t 2 ) = Q(h - t 2 )Tr [diUi{xiM),djUj{x 2 ,t 2 )]\ 
Gadv(xi,ti;x2,t 2 ) = 0(t 2 - *i)Tr {e"^ [d jUj {x 2 , t 2 ), c^(xi, h)]fy.31) 

For free phonons, the zero-temperature advanced and retarded correla- 
tion functions can be obtained by choosing the correct id prescription for 
the poles: 

G ret (x 1 ,h;x 2 ,t 2 )= I >j^e i fr(*i-*i)-"(«i-« 2 )) l f (5.32) 

j (uj + toy — vfp z 

/. 2 
d 3 pdue^-^-^-^ ^ (5.33) 

For interacting phonons, the situation is not so simple. From (11.53), we 
see that for ito n in the upper-half-plane, we can obtain Q from G re t by taking 
to — > iu n . From (11.53), we see that Q(—iuj n ) = Q*(iuj n ), from which we can 
obtain Q for iuj n in the lower-half-plane. In other words, we can always 
obtain Q from G re t- What we would like to do, however, is obtain G re t from 
Q. This analytic continuation from the Matsubara frequencies iuj n to the 
entire upper-half-plane can often be done by simply taking iuj n — > to + iS. In 
the specific examples which we will look at, this procedure works. However, 
there is no general theorem stating that this can always be done. 

In the next chapter, we will see why retarded correlation functions are 
intimately related to experimental measurements. 
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5.5 Evaluating Matsubara Sums 

We can use contour integration and the fact that the poles of nfi(w) are 
precisely the Matsubara frequencies, u) n = 2n7r//3, to convert sums over 
Matsubara frequencies into integrals. As we will see, it is natural to rewrite 
these integrals in terms of advanced and retarded Green functions. 




Figure 5.1: The contour of integration in (5.35). 
Consider the sum 

-^2g(in m -iu n ,p-q)g(iuj n ,q) (5.34) 

' n 

This sum is equal to the following contour integral (see figure 5.1) since 
the integral avoids the singularities of the Green functions; consequently, it 
picks up only the poles of kb(w), thereby leading to the Matsubara sum. 

j)^—n B (uj)g(in m -uj,p- q)G{uj,q) = -^£(i^ m - iuJ n ,p- q)G(iuj n ,q) 

(5.35) 

The singularities of the Green functions occur when to or i£l m — to are real, 
as may be seen from the spectral representation (11.53). The only non- 
vanishing segments of the contour integral are those which run on either 



70 



CHAPTER 5. IMAGINARY-TIME FORMALISM 



side of the lines to = E (the first term on the right-hand-side below) or 
oj = i£l m — E (the second term) where E is real: 

r Af. i poo 

<b — n B (u) Q(u + iVL m ) g(oj) = — / dEn B (E) g{iQ m - E) {g{E + iS) - g(E 

+— / dE n B {iVt m - E) (g(E + i5)-g(E- iS)) g(in m 

Note the reverse limits of integration in the second integral; they arise from 
the fact that E and u are oppositely directed. 

If we assume that the analytic continuation is straightforward (as it often 
is), then we can use (11.56) to write this as: 

r doj 1°° 

^—n B (Lo)g(uj + in m )g(Lj)= J dEn B (E) g(iQ, m — E,p — q)q 2 B(E, q) 

dEn B (iQ rn - E) g(iQ m -E,q)(p- q) 2 B{E,p- q) 

oo 



/oo 
-oo 



Since n B (iQ m — E) = —(1 + n B (E)) for Matsubara frequencies iQ n , we 
finally have: 

1 f°° 

-Y J G{iUn + in m )g{iu Jn )= / dEn B (E)G(in m - E,p-q)q 2 B{E,q) 

P n J-oo 

/oo 
dE (n B (E) + l) g(in m -E,q)(p-q) 2 B(E,p(5$p) 
-oo 

If we also continue ift m — > + id, then we have: 
1 f°° 

-^g(itt m -iu n ,p- q)g(iu n ,q) = / dE n B (E) G TCt (Q - E,p - q) q 2 B(E, q) 

P n J-oo 

/oo 
dE (n B (E) + l) G TCt (tt m ~ E,q) (p - q) 2 B(E,p- q) 
-oo 

(5.37) 

It is important that we did the analytic continuation of the external fre- 
quency, 0, at the end of the calculation, rather than during some intermedi- 
ate step. This type of contour integral trick can be used rather generally to 
bring Matsubara sums to a convenient form, as you will see in the problem 
set. 
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5.6 The Schwinger-Keldysh Contour 

The formalism which we have thus far constructed was designed to determine 
the transition amplitudes from some given intial state at t = — oo to some 
final state at t = oo. In the previous chapter, we were able to relate these 
amplitudes to the amplitude for a system to remain in its vacuum state in 
the presence of an external source j(x, t) and, hence, to correlation functions 
in the vacuum state. We may, however, wish to consider a situation in which 
the system is in a given intial state - say the vacuum state or the state at 
thermal equilibrium at inverse temperature (3 - but we make no assumptions 
about the final state. We may then wish to compute the correlation functions 
of some observable such as diUi(x,t) at time t. 

In order to do this, we imagine evolving the system from some initial 
state \i) at t = — oo to t = oo and then back to t = —oo. The evolution 
operator for such a process is: 

f/(oo,-oo)[/(oo,-oo) (5.38) 

Clearly, this is simply equal to 1 and 2 

(i\U(-oo, oo) f/(oo, -oo)|*> = 1 (5.39) 

Suppose, however, that we switch on a source field, j(x,t), only during the 
forward propagation in time. Then 

(i\U(-oo, oo) Uj (oo, -oo)\i) / 1 (5.40) 

and, by differentiating with respect to j(x,t), we can obtain correlation 
functions. 

If we wish to work with a system at zero-temperature which is in its 
ground state at t = — oo, then we define the generating functional: 

Z[j] = (0|C/(-oo,oo)C/j(oo,-oo)|0) (5.41) 

At finite temperature, we have 

Z[j] = Tr {e~ /3 ^C/(-oo, oo) Uj{oo, -oo)} (5.42) 

or 

Z[j] =Tr{U(-oo-i/3,-oo)U(-oo,oo)Uj(oo,-oo)} (5.43) 

i.e. we evolve the system from t = — ootot = oo then back to t = — oo and 
thence to t = — oo — i(3. 
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Ci 



l i ► 1 L f 

I, io i— — ' t f -io 

f C 4 ° 2 

t 5 — i B 1 



Figure 5.2: Real-time contour separated into four parts that factorize into 
separate contributions: C\ U C2 and C3 U C4. 



This generating functional allows us to calculate non-equilibrium corre- 
lation functions: j(x,t) can drive the system out of equilibrium since we 
make no assumptions about the final state of the system. The price which 
must be paid is the doubling of the number of the fields in the theory; the 
second copy of each field propagates backwards in time. 

The Keldysh contour which we just described is just one example of a 
possible contour along which the time-evolution is performed. There is a 
more general class of contours, C, which go from — 00 to 00; from 00 to 
00 — ia; from 00 — ia to — 00 — ia; and thence to 00 — i(3. We make the 
choice a = (3/2 for which the propagator takes a particularly simple form; 
however, this is a matter of taste. All choices of a share the advantage of 
being real-time formulations and thereby obviating the need for potentially 
ill-defined analytical continuations. 

There is an important factorization property, which we won't prove here, 
for the contributions from each piece of the contour to the functional integral 
Z: 

Z = Z ClUC2 Z C:iVjCA . (5.44) 

where 



Z = Tr {U {-co — i(3, —00 — ia) U (—00 — ia, 00 — ia) U (00 — ia, 00) Uj (00, — o 
= Tr |r ce - i /o* ff intW J (5. 

Only C\ and C2 are important in obtaining correlation functions. Using 
Dyson's formula for U(tf, U), we can expand Z = Z ClUC2 perturbatively as 
we did in the equilibrium zero-temperature and imaginary-time formalisms. 
We can use Wick's theorem to evaluate T c -ordered products. To construct 
the resulting perturbation theory, it is useful to denote the fields on the 
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upper (Ci) and the lower (C2) pieces of the countour by 

u]{t) = Ui(t), uj(t) = Ui(t- ia), (t = real) (5.46) 

The Feynman rules are similar to those of our equilibrium zero-temperature 
theory with the major difference being each vertex is labelled by an index 
a = 1, 2 and the amplitude is assigned a factor of —1 for each a = 2 vertex. 
The vertex resulting from the source field j(x,t) is assigned a = 1. The 
propagator between two vertices with indices a and b is given by: 

-iA ab (t -t',x- x') =< T c [d i uf(t,x)d j u b j (t',x')} > (5.47) 

where T c denotes ordering of fields according their position along the con- 
tour of Fig. 5.2. For the Keldysh contour, the diagonal elements of the 
propagator are the real-time zero-temperature time- and anti-time ordered 
propagators. The off-diagonal element contains all of the information about 
the occupation numbers of states; in thermal equilibrium the occupation 
numbers are given by n#(o;), but they can be more general. For our choice, 
a = (3/2, the dynamical information contained in the zero-temperature prop- 
agator and the information about occupation numbers can be untangled by 
the parametrization: 

iA(u, k) = u{uj) iA (uj, k) v) (u) , (5.48) 

where 

*.<»,*)-(**»<»•*> _4^), (5.49, 

with iGo(w,fe) the usual time-ordered propagator 

iG (t -t',x- x) = (T[diUi(t, x)d j u j (t', x')]) (5.50) 

and — iGo(w,/c), consequently, the anti-time-ordered one 

-iG* (t-t',x-x') = {{T[d i u i {t,x)d i u i {t',x')})Y = {f[d lUi (t',x')d iUl (t,x)}) 

(5.51) 

The matrix u contains the information about the temperature. This 
matrix is given by 

, . /coshA^ sinhAftA . . „ nT „ T=0 

U ( W >=UhA* w cosh A* J ' where A^ = C " C °, 

and cosh 2 dT = — . (5.52) 

Notice that at zero temperature u = 1. 
To summarize, 
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• To each line, we associate a momentum, p and a frequency, to. 

• To each vertex we assign an index a = 1,2. External lines are assigned 
a = 1 at their free end. 

• For each external line with momentum p, write |p| . 

• The propagator assigned to an internal line carrying momentum p and 
frequency uj which connectes vertices labelled by indices a and b is: 



• For each vertex carrying index a with momenta, frequencies (pi, wi), . . . , (p*4, W4) 
directed into the vertex, we write 



(3 - 2a) g (2ir) 3 d(pi + p 2 + P3 + Pi) <5(^i + w 2 + a; 3 + w 4 ) 

• Imagine labelling the vertices 1,2, ... ,n. Vertex i will be connected 
to vertices ji, ■ ■ ■ ,j m {m < 4) and to external momenta pi, ■ ■ ■ ,P4- m - 
Consider a permutation of these labels. Such a permutation leaves the 
diagram invariant if, for all vertices i, i is still connected to vertices 
ji, . . . , j m (m < 4) and to external momenta pi, . . . ,P4- m - If S is the 
number of permutations which leave the diagram invariant, we assign 
a factor 1/S to the diagram. 

• If two vertices are connected by / lines, we assign a factor 1/Z! to the 
diagram. 

In equilibrium, the Schwinger-Keldysh formalism gives results which are 
identical to those of the Matsubara formalism. Out of equilibrium, however, 
the two formalisms give different results; only the Schwinger-Keldysh is 
correct. 



CHAPTER 6 



Measurements and Correlation Functions 



6.1 A Toy Model 



We will now take a break from our development of techniques for calculating 
correlation functions to relate retarded correlation functions to experimental 
measurements. We will also discuss those properties of retarded correlation 
functions which follow from causality, symmetries, and conservation laws. 

Let us take a look at a toy model to see why retarded correlation func- 
tions are useful objects which are simply related to experimentally measur- 
able quantities. Consider a single damped harmonic oscillator, with equation 
of motion 



By causality, Xvct(t—t') must vanish for t—t' < 0. Substituting the definition 
of Xret(t — f), (6.2) into the equation of motion (6.1), we see that Xict(t — t') 
satisfies: 



-£o Xret(* -0+7 -T + Xret{t ~ *') + ^ Xret (t - if) = 5(t - t') (6.3) 



We define the retarded response function, Xret(* — by 




(6.1) 




(6.2) 
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Thus, Xret(t — f) is the Green function of the differential operator on the 
left-hand-side of (6.1) subject to the boundary condition that Xrct (t-f) =0 
for t - a < 0. 

We can define the Fourier transform of 

/OO /'OO 
dte^x(t) = / dte^ X (t) (6.4) 
oo JO 

Since %(t) vanishes for i < 0, the integral is well-defined for uj anywhere in 
the upper-half-plane. Therefore, x( w ) is an analytic function in the upper- 
half-plane. 

Substituting the Fourier representation of x(t) in the equation of motion, 
we find x{ u ) : 

xM = - — ( 6 - 5 ) 

OJq — or — «7LJ 

We can break x( w ) into its rea l an d imaginary parts: 

xH = x 'H + *x» (6-6) 

From (6.5), we have in our toy model: 

X M 



(wg - w 2 ) 2 + (7^) 2 



x>) = R^FTW (6 - 7) 



From the above definition, 

X »=Im{ dte^x{t) 

K J — OO 

/OO -I 



-oo 

poo | 

Jujt 



r 

j dte^f.^l)- \(-!)) (6.8) 



Similarly, 

/OO -I 
^-(xW + xH) (6-9) 
-oo ^ 

Thus, x"(w) is the Fourier transform of the part of x( w ) which is not in- 
variant under t —t while x'( w ) is the Fourier transform of the part of 
X(^) which is invariant under t — > — t. In other words, x"( w ) knows about 



6.1. A TOY MODEL 



77 



the arrow of time which is implicit in the condition x(t — f) vanishes for 
t — t' < 0. x'i 1 * ), on the other hand, does not. 

This is reflected in the fact that x"^) determines the dissipative re- 
sponse. To see this, suppose that we apply a force, F(t). The work done 
by this force is the energy which is tranferred to the system - i.e. the 
dissipation: 

— - F(t) — 
dt [) dt 



F(t)-J dt'Xret(t-t')F(t') 



oo 



F(t) —e wt iu;x{Lo)F{Lo) 

J-oo 27r 



(6.10) 



If we assume F(t) = Focostlot and compute the zero-frequency part of 
dW/dt (rather than the part which oscillates at 2u>o), we find: 

^> = 0) = i Fi i (n x(^o) - nox(-«o)) 

= ±F$n x"(n ) (6.11) 

The essential reason that x' doesn't enter the dissipation is that the time- 
reversal symmetry of x' implies that the energy gain and loss due to x' are 
the same, x' is often called the reactive part of the susceptibility while x" 
is the dissipative or absorptive part. 

For our toy model, the energy dissipation, 

dt ~ 2 ^ "° (c 2 - n§)2 + ( 7 f> )2 [bA2 > 

is maximum at CIq = ±uio, i.e. on resonance. Consider the resonance ~ 
loq. Approximating Qq + ujq 2ujq, we have a Lorentzian lineshape: 



dW _ 1 2 7 
~df ~ 2 4(^o - ^o) 2 +7 2 



o*0 77T n\ a ,.. a ( 6 - 13 ) 



with full-width 7 at half-maximum. 

As a result of the analyticity of x i n the upper-half-plane - i.e. as a 
result of causality - we have the Kramers-Kronig relations. The analyticity 
of x( z ) i n the upper-half-plane implies that: 

dz x{z) 



qTTI Z — OJ 



(6.14) 
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Figure 6.1: The countour of integration taken for the Kramers-Kronig rela- 
tion. 



for the contour of figure 6.1 The integral of the semicircular part of the 
contour vanishes, so we have: 

f-jfc/ xfr/) _„ 
J_ 00 iri ux + it - oj 

Using the relation: 

tA =V (— — ) -iirS(u/ -u) 



we have 



(6.16) 



J-oc 7T U'-U 



Comparing real and imaginary parts, we have the Kramers-Kronig relations: 

ro ° dw 1 x"(w') 



<» = -pf^4^ (6.18) 



6.2 General Formulation 

We can cull the essential features of the toy model and apply them to a 
many-body system such as our phonon theory. It is fairly clear that this 
can be done for a free field theory which is, after all, just a set of harmonic 
oscillators. As we will see momentarily, this can be done - at least at a 
formal level - for any theory. 
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Suppose our external probe couples to our system through a term in the 
Hamiltonian of the form: 



#probc = J 4>{X, t) f(x, t) 



(6.19) 



f(x, t) is our external probe, which we control, and 4>(x, t) is some quantum 
field describing the system. In our phonon theory, we could take <p(x, t) = 
dkUk(x,t) in which case our probe compresses the solid. Alternatively, we 
could take 4>(x,t) = us(x,t), causing displacements along the 3-direction. 
We will work - as in the last chapter when we defined Green functions - in 
an interaction representation in which -£/p ro be is the interaction Hamiltonian 
and the rest of the Hamiltonian is the 'free' Hamiltonian. Let us suppose 
that we now measure the field rj(x,t), which may or may not be the same 
as 4>(x,t). Its expectation value is given by: 



(V(x,t)) = {0\Uf 1 {t,-<x)ri{x,t)U I (t,-<x)\0) 



(6.20) 



where 



(6.21) 



E7}(t, -oo) = T { e <*WO) 

= l-i [ dt' H pmhc (t') + . . . 

J — oo 

If we keep only terms up to first-order in i/ pro b e , then we have: 

dt' tf pro bc(0) V(x, t) (l - ij dt' tf probc (0) 



(r](x, t)} 



l + i 



(0\ V (x,t)\0) + {0 



dt' [H prohe (t'),r](x,t)] 



= (0\ V (x,t)\0) + i [ [ dt' f(x',t')(0\ [cf>(x',l/),ri(x,t)](j^2) 

We have added a subscript to emphasize that these are interaction picture 
states - i.e. these are expectation values in the absence of the probe. Let 
us assume, as is usually the case, that 



(0\r](x,t)\0) o = 



(6.23) 



Then, 



( V (x,t))=i I d 3 x' f dt' f(x',t')(0\ [c/ ) (x , ,t'),r l (x,t)]\Q) (6.24) 

J J —oo 
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The commutator on the right-hand-side, (0| [4>(x', t'),n(x, t)]\0), is an exam- 
ple of a response function. 

Let us specialize to the case rj(x,i) = cf)(x',t'). Then, we write: 



(rj{x,t)) = -i f d 3 x ! dt' f(x,t')x(x,x;t,t') (6.25) 

If the Hamiltonian is space- and time-translationally in the absence of i7 pro b e , 
then we can write: 

X (x,x';t,t')=x(x-x',t-t') (6.26) 
We can also extend the dt' integral to oo 

/f'OO 

d 3 x' dt' f(x',t') X (x- x',t-t') (6.27) 
J — oo 

if we define 

X {x -x,t-t') = i9{t - t') (0 | [<f>(x, t),<f>(^, t')} | 0) (6.28) 

As in our toy model, we can define the Fourier transform with respect 
to time, x(x — x',uj) and its real and imaginary parts, 

x(x-x',oj) =x'{x-x , ,u J )+ix"(x-x , ,oj) (6.29) 

Following the steps of (6.8), we see that x"(x—x', u) is the Fourier transform 
of the commutator without a ^-function: 

X "(x -x',t-t')= f — e-^'-'V^ " x\ to) (6.30) 
J 2tt 

where 

x "(f _ ^ t _ t <) = (o | [0(x, t), <j>{0, t')] | 0> (6.31) 
As in our toy model, satisfies the antisymmetry properties: 

X "(x -x',t- t') = - X "(x - x, t' - t) 

XV -x,u) = - X "(x - x, -u) (6.32) 

These properties follow from the fact that x" ls a commutator. 

Following the steps which we took in our toy model, the dissipation rate 
under the influence of a periodic probe, f(x, t) = f w {x) cosut is given by 

^ = J d 3 xd 3 xfu X "(x - x,oo)US)Ux) (6-33) 
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We can also follow the derivation for our toy model to show that x(a? — 
x, uj) satisfies the Kramers-Kronig relations (6.18). The Kramers-Kronig 
relations can be approached from a different angle by constructing a spec- 
tral representation for the response function (6.28). Following our earlier 
derivations of spectral representations, we have: 



where 



p(q,E) = Y,\(m\H0,0)\0)\ 2 5(q-p m )5(E-u; m ) (6.35) 



If we construct a spectral representation for x"(<f, oj) - which can be done 
trivially since there are no ^-functions - or simply take the imaginary part 
of (6.34), we see that: 

x"(<7>) = irp(q,u) (6.36) 
In other words, )("{q,uj) is the spectral function for x{q,w): 

X(?,w)= / — -= — - — (6.37) 

J-oo 7T E -UJ -id 

which is the Kramers-Kronig relation. According to (6.37) %(<f, uj) is singular 
whenever x"(g, u) is non- vanishing. These are the regions of phase space 
where there are states of the system with which the external probe can 
resonate, thereby causing dissipation. 

6.3 The Fluctuation-Dissipation Theorem 

Consider the correlation function: 

S^(x, t) = Tr (e~P H r](x, t) 0(0, 0)) (6.38) 
The response function can be expressed in terms of the correlation function: 

X V ct>{x, t) = 6(t) (S V(j) (x, t) - Sfai-x, -*)) (6-39) 
and its dissipative part is simply 

X v <p"{x, t) = (S V( p{x, t) - S H (-x, -t)) (6.40) 

or 

X v /{x, lo) = {S vlP (x, uj) - S^(-x, -uj)) (6.41) 
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By the cyclic property of the trace, 



S v4> (x, t) = Tr (e~ pH rj(x, t) 0(0, 0)) 
= Tr (V(0,0)e _/3H rj(x, t)j 
= Tr (e- pH e^ H <f>(0, 0)e" f3H V (x, t)) 
= Tr (e-P H 4>(0,-iP) V (x,t)) 



= S^i-x, -t - i/3) 



(6.42) 



Hence, 



S v<j) (x,uj) = e^S^-x,-^) 



(6.43) 



Thus, we finally have: 



X v <t>"(x,u) = (l-e ^ S v4> {x,u)) 



(6.44) 



Since the right-hand-side is a measure of the dissipation and the left-hand- 
side is a measure of the fluctuation, (6.44) is called the fluctuation-dissipation 
theorem. As we will see shortly, neutron scattering experiments measure 
S pp (q,uj), and the fluctuation-dissipation theorem relates this to a quantity 
which we can attempt to calculate using the imaginary-time formalism: the 
imaginary part of a retarded correlation function, Xpp '(Qi^)- 

6.4 Perturbative Example 

Let us consider the case in which <j) = r/ = dtu\. dtU\ is the current in the 
xi-direction carried by the ions in the solid, so we are driving the solid in 
the xi-direction and measuring the subsequent flow of the positive ions in 
this direction. Then x<j><j> is given by the retarded correlation function of dfUi 
with itself. Let us make the simplifying assumption that vi = vt- According 
to the iS prescription of (5.32), for free phonons this is: 



In other words, there will only be dissipation if the compressional force has 
a component with lu 2 = vfq 2 . Thus a measurement of this response function 
is a direct measurment of vi, i.e. of the phonon spectrum. 




(6.45) 



Hence, 



xlcj, = sgn{u)5{u 2 - vfq 2 ) 



(6.46) 
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If the phonons are interacting, 

,2 



X<W(W,9) = 7 -TTT^ 2 2 i TT ? \ ( 6 - 47 ) 

(cj + id) 2 - vfq 2 + II ret (u;, (?) 
and the (5-function is broadened: 

x'L = ^im { n ret (c g) } 

( W 2 _ w 2g2 + Re { n ret ( w , ^)}) 2 + (Im {n ret (co, g)}) 2 

Consider a perturbative computation of U re t(co,q). At O(g), there is 
diagram (a) of figure ??. This diagram gives a purely real contribution 



) (6.49) 



In the problem set, you will compute the Matsubara sum. At zero temper- 
ature, we find: 

nf ^ 9 2 [ d3 P P t r>( 2\ 

= (const.) gq 2 A 4 / v + 0(g 2 ) 

= (Svf)q 2 (6.50) 

where (Svf) = (const.) g A A /v. The analytic continuation to the retarded 
Green function is trivial. 

The first contribution to the imaginary part of the self-energy comes 
from diagram (b) of ??. It is given by 

g 2 1 f d 3 pi d 3 p 2 



Im {U(iuj n , q)} = lm{q 2 ^Tp^ f 4^1 4^ $( iuJn ~ ~ iuJn z > 9 ~ Pi ~ P2) 

m,n2 

G(iuJ ni ,Pi)G(iuJn 2 ,P2)} iuJ ^ ul+i5 + 0(g 3 ) (6.51) 



Evaluating this integral is tedious, but we can make some simple observa- 
tions. When we convert the two Matsubara sums to integrals, we will con- 
vert the Green functions to spectral functions; taking the imaginary part 
will convert the final one to a spectral function: 

Im{n^ n ,,)} = Im{g 2 ^ J dun J du> 2 J ^ ^ Q B B}^ +i5 + 0(g 3 ) 
= Im{g 2 |- Jdu* Jdu 2 J ^^BBB}^ +tS 
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+0(g 3 ) (6.52) 

By taking the imaginary part, we have put the three internal phonon lines 
on-shell. There is no phase space for the (5-functions to be satisfied if u = 
(since p\ and f>2 will have to be collinear), so the integral is propotional to 
to. Hence, we have: 

Im {U(iu n , q)} = Dq 2 u + 0{q A u) + 0(g 3 ) (6.53) 

where D = (const. )g 2 A 7 . Keeping only the first term, we can now write the 
spectral function as: 



jj- {Dq 2 co) 



(6.54) 

{oj 2 - vfq 2 ) 2 + (Dq 2 Ljf 

where vf = vf + 5vf . 

This is the form of the response function which we expect when g is small 
and TL re t(uJ,q) can be calculated perturbatively. In such a case, the correc- 
tions due to H ret (uJ, q) are small and lead to a small damping of a propagating 
mode. In general, however, the calculation of the response function, x-, ls a 
difficult problem. Nevertheless, we can often say something about \ since 
some of its general features follow from conservation laws and symmetries. 
The resulting equations satisfied by physical quantities (including response 
functions) are hydrodynamic equations. 



6.5 Hydrodynamic Examples 

Let us consider as an example some particles dissolved in a fluid. The 
density, p, and current, J of these particles will satisfy a conservation law: 

lLp + V.j = (6.55) 

Unlike in the case of a propagating mode, p and J M will satisfy a constitutive 
relation: 

J = -DVp + f cxt (6.56) 

where D is the diffusion constant and B cxt is an external force acting on the 
particles (such as gravity). Ideally, we would like to compute D (perturba- 
tively or by going beyond perturbation theory), but in many cases we must 
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leave it as a phenomenological parameter. As a result of the constitutive 
relation, p satisfies the diffusion equation: 



Hence, 



^p-DV 2 p = V-^xt (6.57) 



X PP (u,q) = . l — 2 (6.58) 



and 

w 2 + (IV) 

Thus, x'pp/Q is a Lorentzian centered at oj = with width Dg 2 . Similarly, 



Xp P (^,g)= „ , 1 on2 (6-59) 



and 



Note that this is precisely the same as (6.54) for v = 0. 

In this example, we have seen how the low q,co behavior of response 
functions of conserved quantities and their associated currents can be deter- 
mined by a knowledge of the hydrodynamic modes of the system. In general, 
there will be one hydrodynamic mode for each conservation law satisfied 
by the system. The conservation law, together with a constitutive relation, 
leads to hydrodynamic equations satisfied by the conserved quantity and its 
current. These equations, in turn, determine the correlation functions. Note 
that such constraints usually only hold for conserved quantities; correlation 
functions of arbitrary fields are typically unconstrained. 

Observe that (6.61) is of precisely the same form as (6.54) above, but 
with vi = 0. In this section and the last, we have seen how response func- 
tions which are calculated perturbatively (as we imagined doing in the first 
example) are often of the same form as those which are deduced from the 
hydrodynamic - or long- wavelength, low-frequency - equations which they 
satisfy. Hydrodynamic laws hold in the long- wavelength, low-frequency limit 
in which local equilibrium is maintained so that constitutive relations such 
as (6.56) hold. Linear response theory holds in the limit of small / cxt . 
When both of these conditions are satisfied, we can sometimes perturba- 
tively calculate response functions which satisfy the constraints imposed by 
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hydrodynamic relations. In chapter 7, we will see examples of this in the 
context of spin systems. 

In a solid, we might be interested in the response functions for the energy 
density and the mass density. It turns out that these quantities are coupled 
so that the "normal modes" of the solid are a combination of the energy 
and mass. One of thse normal modes diffuses while the other is a (damped) 
propagating mode. Consequently, \pp-> XpE, arid xee are given by linear 
combinations of functions of the form of (6.54) and (6.59). 



6.6 Kubo Formulae 



Transport measurements fit naturally into the paradigm of linear response 
theory: a weak external probe - such as a potential or temperature gradient 
- is applied and the resulting currents are measured. Transport coefficients 
relate the resulting currents to the applied gradients. These coefficients - 
or the corresponding response functions - may be derived by following the 
steps of section 2. 

We have already encountered one example of a transport coefficient 
which can be obtained from a response function, namely the diffusion con- 
stant, D, which can be obtained from (6.54) or (6.61) by: 



(6.62) 



To see why transport properties should, in general, be related to such 
limits of response functions, let us derive the corresponding relation, or 
Kubo formula, for the electrical conductivity of a system. Let j denote the 
current in our condensed matter system when the external vector and scalar 
potentials, A and p, are zero. Let p be the charge density. Then, when 
we turn on the electromagnetic field, the current is given by J = j — ^-A. 
Meanwhile, -£f pro be is given by: 

H prohc = J d 3 x (-p(x,t)<p(x,t) + j(x,t) ■ A(x,t) + -^p(x,t)A 2 (x,t^j 

(6.63) 

Following our derivation of the response function in section 2, we have 



J(x,t)) = (0 



l + i 

J\x, t) 



[ dt' H piohe (t')\ J{x,t) (l-i [ dt' H prohc (t'] 
H piohe (t'),J(x, t) 



o> o + < 



/ / dt' 

oo 





(6.64) 
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Using 



J 



()) o = (0 



ne 

0) A 

o m 



ne 



m 



A 



(6.65) 



and the expression (6.63) for H pro ^, c and keeping only terms linear in A, we 
have: 

(J i (x,t)) = i I d 3 x' f dt' [j J (x , ,t'),j t (x,t)]A i (x , ,t') 

J J — CO 

- i f d 3 x' f dt 1 [ji(x", f),p(x, t)] <p(af, if) 

J J — oo 



ne 



m 



Ai(x, t) 



(6.66) 



We are free to choose any gauge we want, so let's take ip = gauge. Then, 

(Ji(x,t))=i J d 3 x' y°° dt'6(t-t') [j j (x , ,t'),j i (x,t)] Mx^f) - ^A(x,t) 

(6.67) 

In this gauge, E = dA/dt, so we naively have: 

//*00 
d 3 x dt' Oij{x -tf,t- t') E^x 1 , t') 
J —oo 



or 



(Ji(q,uj)) = (Jij{q,u) Ej{q,u)) 



with 



1 r°° 1 ne 2 
at, (q, u) = - dt e- wt \jji-q, 0) , ji (q, t)] - 

In terms of the response function, 

1 1 ne 2 
aij(q,uj) = —Xjj(q,u) - 



(6.68) 
(6.69) 

(6.70) 
(6.71) 



The first term on the right-hand-side leads to the real part of the conduc- 
tivity: 



1 



Reaij(q,uj) = -x"j(q,w 



(6.72) 
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while the second term - if it's not cancelled by the imaginary part of the 
first term - leads to superconductivity 

The DC conductivity is obtained by taking the q — > limit first, to set 
up a spatially uniform current, and then taking the DC limit, to — ► 0. 



If we take to — ► 0, then we'll get a static, inhomogenous charge distribution 
and the q — > limit won't tell us anything about the conductivity. 

The above formulas are almost right. The problem with them is that 
they give the response to the applied electric field. In fact, we want the 
response to the total electric field. Using Maxwell's equations and our linear 
response result for J, we can compute the total field and thereby find the 
correction to (6.73). This issue is most relevant in the context of interacting 
electrons, so we will defer a thorough discussion of it to that chapter. 

6.7 Inelastic Scattering Experiments 

Another way of experimentally probing a condensed matter system involves 
scattering a neutron off the system and studying the energy and angular 
dependence of the resulting cross-section. This is typically (but not exclu- 
sively) done with neutrons rather than photons - for which the requisite 
energy resolution has not yet been achieved - or electrons - which have the 
complication of a form factor arising from the long-range Coulomb interac- 
tions. 

Let us assume that our system is in thermal equilibrium at inverse tem- 
perature (3 and that neutrons interact with our system via the Hamiltonian 
H' . Suppose that neutrons of momentum fcj, and energy cjj are scattered by 
our system. The differential cross-section for the neutrons to be scattered 
into a solid angle dQ, centered about kf and into the energy range between 
LOf ±duj is: 




(6.73) 



dQ dto 




kf;m\H'\ki;n\ e 



^ S (cj + E n - E m ) (6.74) 



where to = oji — LOf and n and m label the initial and final states of our 
system. 

For simplicity, let us assume that there are simple 5-function interactions 
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between the neutrons and the particles in our system: 

H' = V^5(x -Xj)=V p(x) (6.75) 

3 

Then 

(k f -,m\H'\ki-,n) = V J S'xe 1 ^ (m\p(x)\n) 

= V (m\p(q)\n) (6.76) 

If we use the Fourier representation of the J-function, 

1 f 00 

8(lo + E n -E m ) = — dt ji"+E n -E m )t (6 J7) 

and pass from the Schrodinger to the Heisenberg representation, 

e i(E n -E m )t ( m |^| ^ = ( n \ e iHt p ^ e -iHt\ m > 

= (n\p(q,t)\m) (6.78) 

then we can rewrite (6.74) as 

m * v 7 

We can now use \m) (m\ = 1 and write this as 

If we define the dynamic structure factor, S(q,to) = S pp (q,tu): 

S(q,cj) 5(lj + J) = Tr {e-? H p(q,Lu)p(-q,Lu')} (6.81) 
then we can write the inelastic scattering cross-section as: 



dQ dto 2tt hi \ 2n J 
According to the fluctuation-dissipation theorem, this can be written 



d 2 a 1 k f fM\ 2 ( 1 



dQ dto 2?r h V 2?r J V 1 - 



x"(g>) (6.83) 
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In our elastic theory of a solid, 

p(x) =Y^s(x-Ri- «(£)) (6.84) 

i 

p (g) = £V*(fli-a(fli)) ( 6 . 8 5) 

i 

Let us assume that the displacements of the ions are small and expand the 
exponential, 

i 

~ J2S(q~ Q) ~ iq- u(q) (6.86) 
Q 

where Q is the set of reciprocal lattice vectors. By dropping the higher-order 
terms in the expansion of the exponential, we are neglecting multi-phonon 
emission processes. Hence, the scattering cross-section is given by the sum 
of the contributions of the Bragg peaks together with the contributions of 
one-phonon emission processes: 

+ [Tr|e _/3H q-u(q,u)q-u(-q, 

(6.87) 

Recognizing our longitudinal phonon Green function on the right-hand-side 
and using the fluctuation-dissipation theorem, we can write this as: 

Im{G rct (g,w)} 
(6.88) 

Hence, the quantity which our imaginary-time perturbation theory is de- 
signed to compute - G iet (q,u>) is precisely the quantity which is measured 
in inelastic scattering experiments. If we assume a self-energy as we did in 
(6.54), then there will be Lorentzian peaks at uj = ±viq of width Dq 2 . 

6.8 Neutron Scattering by Spin Systems-xxx 

6.9 NMR Relaxation Rate 

In nuclear magnetic resonance, or NMR, experiments, a material is placed 
in a constant magnetic field. As a result of this magnetic field, there is 



d 2 a _ 1 k f (M 
d£l dio 2-7T k{ \ 2-7T 



+ 
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an energy splitting ujq between the up-spin excited state and the down- 
spin ground state of the nuclei (let's assume spin- 1/2 nuclei). If an up-spin 
state were an energy eigenstate, then electromagnetic radiation at frequency 
loq would be perfectly resonant with the nuclear spins; the absorption cross 
section would have a 5- function at loq. As a result of the interaction between 
nuclear spins and the other excitations in the system (electrons, phonons, 
magnons), the up-spin state has a finite lifetime, T±. The width of the 
resonance is, therefore, 1/T\. A measurement of T\ is an important probe 
of the spin-carrying excitations of a system. 

The interaction Hamiltonian for the coupling between a nuclear spin and 
the other excitations is: 

Him = / (0 MQ) II+S-(Q) + I-S+(q)] (6.89) 

A(q) is the hyperfine coupling between the the nuclear spin / and the spin 
density S(q) due to the excitations of the system. The lifetime of the up-spin 
state is given by: 

^ = J2K^ m \ H '\ V,n)\ 2 e-P E "5(uJv + E n -E m ) (6.90) 



Following the steps which we used in the derivation of the scattering cross- 
section, we rewrite this as: 



e -PE n 



T 1 = J (2^ A(g W oo ^^+ B «-^)^(n| 1 S_(^)|m)(m|5 + ( 9 )|n) 
= / 7^^)^r^e- ot ^(n|5_(-g,0)|m)(m| 1 S + (g,t)|n)e-^ 
= I ^A{q)Y,{n\S-{-q-uJo)\m){m\S + {q^)\n)e-^ 
= / (02^)E^I 5 -(-9'- w o)^ + (9^o)|n) e -^ 
= f 70 A («) Tr S-(-q,-u, )S + (q,u; Q )} (6.91) 



or, using the fluctuation-dissipation theorem, 



(6-92) 



Ti J (2tt) 2 w i-e-puo 
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u>0 is usually a very small frequency, compared to the natural frequency 
scales of electrons, spin waves, etc., so we can take uq — ► 0: 

1 Sw? A ^^^- { -^ (6 - 93) 



TiT 



CHAPTER 7 



Functional Integrals 



7.1 Gaussian Integrals 

We will now shift gears and develop a formalism which will give us a fresh 
perspective on many-body theory and its associated approximation meth- 
ods. This formalism - functional integration - will also reveal the under- 
lying similarity and relationship between quantum and classical statistical 
mechanics. 

Consider the Gaussian integral, 



This integral is well-defined for any complex a so long as Re{a} > 0. We 
can generalize this to integration over n variables, 



and even to integration over complex variables z% with d n zd n z* = d n (Rez)d n (lmz), 




(7.1) 




(7.2) 




(7.3) 



so long as Ay is a symmetric matrix with Re{^4jj} > 0. 
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By completing the square, 

^ XiA ijXj + b iXi = \{xi- x°) Aij ( Xj - x°) - i bi (A'^.Mj (7.4) 

where x? = (A _1 )jj&j, we can do the integral of the exponential of a 
quadratic form: 

J d n xe- 1 2 XiAi ^ +biXi = (27r) n/2 (detA)' 1 / 2 b< ( A ~%^ (7.5) 

By differentiating with respect to bi, we can also do the integrals of polyno- 
mials multiplying Gaussians: 

J d n xP(x u ...,x n ) e ~^ A ^+ b ^=p(J^,...,-^j ^( 27 r)"/ 2 (detA)-VM 6< ( A ~% 6 ^ 

(7.6) 

A non-Gaussian integral can often be approximated by a Gaussian inte- 
gral using the saddle-point approximation: 

jd n xe' x ^ « J dnxe~ Xf ^~ m9jf) ^ O^X^-*?) 

= (2^/ 2 e -V(*?) (drf ( Aftfl,-/)^) "* (7.7) 

Where x° is a stationary point of /(xj), i.e. djf(x®) = for all j. This 
approximation is good in the A — > oo limit where the minimum of /(xj) 
dominates the integral. 

Nothing that we have done so far depended on having n finite. If we 
blithely allow n to be infinite (ignoring the protests of our mathematician 
friends), we have the Gaussian functional integral. In the next section, we 
will do this by making the replacement i — > t, Xj — ► x(t), and 

j d n xe-^ A ^ ^ J Dx(t)e~ St ? dt * x( - t) (~*p) x ® (7.8) 

As we will see, the generating functional, Z, can be expressed in this way. 
Such an expression for the generating functional will facilitate many formal 
manipulations such as changes of variables and symmetry transformations. 
It will also guide our intuition about quantum mechanical processes and 
emphasize the connections with classical statistical mechanics. 



7.2. THE FEYNMAN PATH INTEGRAL 
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7.2 The Feynman Path Integral 

In this section, we will - following Feynman - give an argument relating the 
matrix elements of the evolution operator, U, of a free particle to a Gaussian 
functional integral. This derivation can be made more or less rigorous, but 
shouldn't be taken overly seriously. We could just as well write down the 
functional integral without any further ado and justify it by the fact that it 
gives the same result as canonical quantization - ultimately, this is its real 
justification. Here, our reason for discussing Feynman's derivation lies its 
heuristic value and its intuitive appeal. 

Suppose that we have a particle in one dimension moving in a potential 
V(x). Then 



Then the imaginary-time evolution operator is given by 

U(t f ,t i ) = e-^f^ H 

We would like to compute 

(xf\U(Tf,n)\ Xi) 
In order to do this, we will write (the Trotter product formula) 



-(r f -n)H 



-StH 



(7.9) 
(7.10) 
(7.11) 

(7.12) 



in the limit 5r = (jf — Ti)/N T — > 0. We will now take the desired matrix 
element (7.11), 

(7.13) 



x f 



e SrH _ _ _ e -8rH 



and insert a resolution of the identity, 

dx\x)(x\ - 1 

between each factor of e~ SrH : 



(7.14) 



-StH 



J ... J dX\. ..dXN T -! (xf 



-StH 



XN T -1 



Xi 



^StH 



(7.15) 

When we repeat this derivation for a quantum mechanical spin in the next 
chapter, we will insert a resolution of the identity in terms of an overcomplete 
set of states. As a result, the integration measure will be non- trivial. 
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We now make an approximation which is accurate to 0(5t 2 ). In the 
St — ► limit, we will have an exact (though formal for arbitrary V(x)) 
expression for the desired matrix element. Observe that 



Hence, we can write 



(7.16) 



-StH 



%n— 1 



-5r(y(x„_i)) 



+ Q(5t 2 ) (7.17) 



We now insert a complete set of momentum eigenstates into the right- 
hand-side of this expression: 



-St 2— 

e 2m 



x n -i ) e 



-St V(x n -i) 



I 



dp n (x n \Pn) (P 



dp n e' 



-St X— 

e 2m 



Pn(x n — Xn-l) p — St 



2m e 



Sn-iye-*-^"- 1 ) 
(7.18) 



Note that the second line is an ordinary integral of c-numbers. Doing the 
Gaussian p n integral, we have 



*(f(™) 2 +^- 



i) 



(7.19) 



Hence, 



(x f \U(r f ,n)\ Xl ) = J ...J dx 1 ...dx Nr - 1 e ^r(f(^^) + v ^-,)) 

(7.20) 

In the St — > limit, we write 



nJr — > r 



E 5t 



x(t) 
rir 



(7.21) 



(but we make no assumption that x(t) is differentiable or even continuous) 
and 

,—Se 



(xf\U(Tf,Ti)\xi) = J Vx{t)x 
where Se is the imaginary-time - or 'Euclidean' - action: 



(7.22) 



(7.23) 
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The path integral representation suggests a beautiful interpretation of the 
quantum-mechanical transition amplitude: the particle takes all possible tra- 
jectories with each trajectory x(t) contributing e~ SE ^ T ^ to the amplitude. 

If the particle is free or is in a harmonic oscillator potential, V(x) = 
mw 2 x 2 /2, this is a Gaussian functional integral: 

Vx(r) e-ti dT 2 m (f )M m ^ 2 = Nde t(-^ + ^ (7.24) 

where the determinant is taken over the space of functions satisfying x(ti) = 
Xi, x(tf) = Xf and A is a 'normalization constant' into which we have 
absorbed factors of m, ir, etc. 

For a more general potential, the path integral can be defined perturba- 
tively using (7.6) 




Time-ordered expectation values can be simply handled by the path- 
integral formalism: 

(x f \T (x(n) . . . x(r n ) U(r f , n))\ Xl ) = j Vx(t) x(n) . . . x(r n ) e~^ dr 

(7.26) 

As you will show in the problem set, this follows because the T-symbol 
puts the operators in precisely the right order so that they can act on the 
appropriate resolutions of the identity and become c- numbers. 

7.3 The Functional Integral in Many-Body The- 
ory 

Instead of following the steps of the previous section to derive the functional 
integral for a field theory, we will simply demonstrate that the generating 
functional, Z\j], is given by 3 : 

Z[j]=N Jvue~ SE[j] (7.27) 

where SeU] is the imaginary-time - or Euclidean - action in the presence 
of an external source field j(x,r) and A is a normalization factor. 
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To show that this is true, we will first show that it is true for a free field, 
i.e. g = in our phonon Lagrangian. The generating functional, Z[j] is 
given by the exponential of the generating functional for connected Green 
functions, W[j], which, in turn, is given by a single diagram: 

Z [j] = e Wo[j] = e 5/j'WGo(x-y)j(2/) z ^ ( 7>28 ) 

The functional integral, on the other hand, is given by 

N jvu e~ s ^ = N [det (~Mr " (A* + VWj ~ ^ijd k d k )] ~^ I j(x)G (x~y)j( y ) 

(7.29) 

which follows because 

G = OA (Sijdt - + X)d l d j - fiSijdkdk)- 1 (7.30) 

If we choose N to cancel the determinant, we have the desired result. 

Now consider the interacting case, S = Sq + J Ci n t{dkUk)- Then, follow- 
ing (7.6), we can write the functional integral as: 

W / P «^/^<W_£ e -/M*)e™ (7.31, 

According to Dyson's formula, we have precisely the same thing for the 
generating functional. Hence (7.27) is true even for an interacting theory. 
By straightforward extension, we can show that the same relation holds at 
finite-temperature, where imaginary-time integrals run from to (3. 

An important result which follows from this discussion is that the propa- 
gator is simply the inverse of the differential operator in the quadratic term 
in the action. The inverse is almost always most easily taken in momentum 
space. 

With the functional integral representation of Z[j] in hand, we can give 
simple proofs of Wick's theorem and the Feynman rules. To do this, it's 
helpful to use the following identity (which we state for finite-dimensional 
vector spaces) 

-4) G < x »=( G (- ! l) F(y)e ' xy )^ (7 - 32) 

This identity can be proven by expanding F and G in plane waves. 

F(x) = e ia ' x 

G(y) = e lb ' y (7.33) 



7.4. SADDLE POINT APPROXIMATION, LOOP EXPANSION 
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The left-hand-side of (7.32) is 



gia-9/cbCgib-x gib-(x+a) 



while the right-hand-side is: 



e ih-d/dy e i(x+a)-y _ g i(x+a)-(y+b) 



(7.34) 



(7.35) 



from which (7.32) follows. 

Using this, we can compute the time-ordered product of a string of fields 
in a free field theory: 

S n Z\j] 



(0\T(d kUk (xi).--d k u k (x n ))\0) 



Sj(xi)...Sj(x n ) 
5 5 



| f j(x)G (x-y)j(y) 



Sj(xi) Sj(xi) 
j/ G »N)«S 



Jj=0 



( a iUiW) 5( d] u j( y)) Q m ( Xl ) . . . d iUi (x n ) 



J Ui=0 

(7.36) 



This is Wick's theorem in the form in which we rewrote it at the end of 
section 4.4. 

In the same way, we can derive the Feynman rules for an interacting 
theory. Using our expression (7.31), 



Z[j] =Ne~ JCmt (^ e w °W 



= N 



(7.37) 



Using our identity, (7.32) we can rewrite this as: 

Z[j] = N G °( x ~ y ^ sd k u k (x) 6djtj(v) e - / c int (d k u k )+jd k u k (7.38) 

which is a compact expression of the Feynman rules for the generating func- 
tional. 



7.4 Saddle Point Approximation, Loop Expansion 

As we pointed out in chapter 5, the loop expansion is an expansion in powers 
of h. Using the functional integral, we can obtain another perspective on 
the expansion in powers of h. Restoring the h, the functional integral is 

Z[j]=N [ Vue-^( SE[u]+ -f jdkUk ) (7.39) 
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At saddle-point level, this is given by: 

Z[j] = Ne-*( SE[uC]+ f j9kU ti (7.40) 
where u c is a classical solution of the equation of motion: 

Kij(x - y) Uj (y) - ^di{d k u k f = -dd(x) (7.41) 

and 

Kij(x - x') = 5{x - x') (Sijd* + + Xjdidj + fidijdkdk) (7.42) 

If we use the classical solution which is obtained by starting with the g = 
solution: 

u t {x) = -K^( x -y)d ] j{y) (7.43) 
and solve this iteratively: 

Ui(x) = -K^\x-y)d j j(y)+^K-\x-x')d l {d k K-l(x' - x")djj{x")f +. . . 

(7.44) 

we obtain: 

W[j] = lnZ[j] = l(ldd(x)Kj(x - y)d J j{y) 

- l - di j{x)^K-\x - x'W^K-Hx' - x")d j j{x")f + ...) 

(7.45) 

This is the contribution to the generating functional for connected diagrams 
coming from tree-level diagrams. In the terms not shown, for each additional 
vertex carrying a g we have one extra internal line - i.e. a propagator K^ 1 
which is not attached to a dij. 

The Gaussian fluctuations about the saddle point contribute to the func- 
tional integral a factor of 

det (- Kij + |a,a,(a fc 4) 2 ) = ^K-^+§^(^) 2 ) {7M) 

This gives the following contribution to the generating functional of con- 
nected diagrams: 

Tr In [-Kij + \d i d j {d k u c k ) 2 ') = Tr In (-K tj ) + Tr In (l - KJ^d i d j {d k u c k ) 2 ') 

= Tr In {-Kij) - Tr (i^f^^) 2 ) + 
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The first term is independent of j(x, t) and can be absorbed in the normal- 
ization. The rest of the series gives the connected one-loops contributions. 
The second term is the loop obtained by connecting a point to itself with 
a propagator; the u£'s attach all possible tree diagrams to this loop. The 
third term is the loop obtained by connecting two points by two propagators 
and again attaching all possible tree diagrams. The next term (not written) 
is the loop obtained by connecting three points with three propagators, and 
so on. 

Hence, the tree-level diagrams give the 0(1/ h) contribution to W[J] 
while the one-loop diagrams give the 0(1) contribution. To see that the 
L-loop diagrams give the 0(h L ~ v ) contribution to W[J], observe that each 
propagator comes with a factor of h (since it is the inverse of the quadratic 
part of the action) while each vertex comes with a factor of 1/h (from the 
perturbative expansion of e s / h ). Hence a diagram with / internal lines and 
V vertices is 0(h I ~ v ). According to the graphical argument we gave in 
chapter 5, I — V = L — 1, which proves the claim. 

Note that we have chosen a particular saddle-point, uf, namely the 
one which can be obtained by solving the classical equations perturbatively 
about g = 0. In principle, we must, of course, sum the contributions from 
all saddle-points of the functional integral. 



7.5 The Functional Integral in Statistical Mechan- 
ics 

7.5.1 The Ising Model and (/? 4 Theory 



(7. 



or, writing out the traces, 




+ 



d 3 xdtd 3 x'dt'(K-^(x' - x,t' - t)^-d m di(d k u c k (x,t)) 2 x 
K-\x -x',t- t'f-did^dkuKx'J)) 2 ) + ... (7.48) 



The functional integral representation of the generating functional of a quan- 
tum mechanical many-body system bears a strong resemblance to the parti- 



102 



CHAPTER 7. FUNCTIONAL INTEGRALS 



tion function of a classical statistical mechanical system. Indeed the formal 
similarity between the two allows us to use the same language and calcula- 
tional techniques to analyze both. To see the correspondance, let's consider 
the Ising model, 

H = -^J ij a i a j (7.49) 

where the spins <jj = ±1 lie on a lattice. The classical partition function is: 

Z = ^\ —>-... 7 " ' : . (7.50) 

We can introduce auxiliary variables, ipi, to rewrite this as: 

Z = N Y1 fvip i e^ iai e~^ Ei ^ J ~ 1 ^i' Piipj (7.51) 

The sum over the cjj's can be done, giving: 

z = nJ2 [ v<Pi e -?^( J ~%^ + ^ lncosh ^ (752) 

If Jij = J(i — j), then the first term in the exponential can be brought to a 
more convenient form by Fourier transforming: 

/dfiq 1 
J^jd JT^^lM-q) (7-53) 

The momenta are cutoff at large q by the inverse lattice spacing. If we are 
interested in small q, this cutoff is unimportant, and we can write 

J(q) = Jo~ \j2Q 2 + 0(q 4 ) (7.54) 
so long as J(i — j) falls off sufficiently rapidly. Hence, we can write 

In the second line, we have gone back to real space and taken the continuum 
limit, <pi — > (f(x). Expanding the other term, 

In cosh Vi = In 2 + \p 2 - —<p 4 + O (if 6 ) (7.56) 
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If we neglect the 0((V<^) 4 ) and higher gradient terms - which seems rea- 
sonable in the small q limit - as well as powers of ip higher than the quartic 
term - which is not obviously the right thing to do, but later will be shown 
to be reasonable - then we can write the partition function as the following 
functional integral. 



where K = TJ2/2.JQ, r = T/Jq — 1, u = 2. Hence, the imaginary-time 
functional integral which we have introduced for the generating functional 
of quantum-mechanical correlation functions is analogous to the classical 
partition function. The weighted sum over all possible classical histories 
is in direct analogy with the sum over all classical configurations. The 
similarity between the functional integral (7.57) and the functional integral 
for our theory of interacting phonons allows us to immediately deduce its 
Feynman rules: 

• Assign a directed momentum to each line. For external lines, the 
momentum is directed into the diagram. 

• For each internal line with momentum q write: 



• For each vertex with momenta p±,...,p4 directed into the vertex, 



• Imagine labelling the vertices 1,2, ... ,n. Vertex i will be connected 
to vertices ji, ■ ■ ■ ,j m {fn < 4) and to external momenta pi, ■ ■ ■ ,p4_ m . 
Consider a permutation of these labels. Such a permutation leaves the 
diagram invariant if, for all vertices i, i is still connected to vertices 
ji, ■ ■ ■ ,j m {jn < 4) and to external momenta pi, ■ ■ ■ ,P4- m - If S is the 
number of permutations which leave the diagram invariant, we assign 
a factor 1/5 to the diagram. 

• If two vertices are connected by I lines, we assign a factor l/U to the 




(7.57) 




write: 



u (2n) d 5(pi +P2+P3+ Pi) 



diagram. 
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In classical equilibrium statistical mechanics, time plays no role, so there 
are momenta but no frequencies (unlike in quantum statistical mechanics 
where, as we have seen, statics and dynamics are intertwined). So long as 
the system is rotationally invariant, the theory will have Euclidean invari- 
ance in d spatial dimensions, and therefore be formally the same as the 
imaginary-time description of a quantum system in d — 1-spatial dimensions 
(and one time dimension). 4 The classical analog of a quantum system at 
finite-temperature is a classical system which has a finite extent, j3, in one 
direction. Much of what we have to say about quantum-mechanical many- 
body systems can be applied to classical systems with little modification. Of 
course, classical statistical mechanics should be contained within quantum 
statistical mechanics in the small (3 limit. 



7.5.2 Mean-Field Theory and the Saddle-Point Approxima- 
tion 

We can apply the saddle-point - or 0-loop - approximation to our functional 
integral (7.57). The classical equation of motion is: 

-KV 2 tp(x) + np(x) + |</? 3 (x) = (7.58) 

Let's look for spatially uniform solutions (f(x) = if. If r > 0, there is only 

<p = (7.59) 
However, for r < there are also the solutions 



<p = ±d (7.60) 

V u 

These latter solutions have larger saddle-point contribution: exp(3r 2 /2u) 
compared to 1 and are therefore more important. 

r = occurs at T = T c = Jo. For T > T c , there is only one saddle-point 
solution, if = 0. According to the saddle-point approximation, at T = T c , 
a phase transition occurs, and for T < T c there is spontaneous magnetiza- 
tion in the system, ip ~ ±\/T c — T. Of course, we shouldn't stop with the 
saddle-point approximation but should include higher-loop processes. In the 
problem set, you will find the Ginzburg criterion which determines whether 
higher-loop processes invalidate the saddle-point analysis. 

The saddle-point analysis reproduces and is completely equivalent to the 
standard mean- field-theory. To make this more obvious, let's use the full 
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105 



potential (7.56) rather than the one truncated at quartic order: 

Z = N J Vipe ~ f^ih^f+^-y-mcos^) (7 61) 

The saddle-point equation is: 

T 

— (p = tanhip (7.62) 

•A) 

This is the usual self-consistency condition of mean-field theory which pre- 
dicts T c = Jq. 

For purposes of comparison, let's recapitulate mean-field theory. We 
replace the effective field which each spins sees as a result of its interaction 
with its neighbors, 

H = - -^Jijaidj (7.63) 

i,j 

by a mean-field, h: 

H = -J2hai (7.64) 

i 

with h given by 

h = y^Jij{(Tj) = JojcTj) (7.65) 

i 

In this field, the partition function is just 2 cosh ^ and 

(a) = tanh ^ (7.66) 
Using the self-consistency condition, this is: 

(.) = tanh^ (7.67) 
which is the saddle-point condition above. 



7.6 The Transfer Matrix** 
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Part III 

Goldstone Modes and 
Spontaneous Symmetry 
Breaking 
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Spin Systems and Magnons 



8.1 Coherent-State Path Integral for a Single Spin 

Let us follow Feynman's derivation of the functional integral to formulate a 
functional integral for a quantum-mechanical spin. A quantum mechanical 
spin of magnitude s has a 2s + 1-dimensional Hilbert space of states. One 
basis is: 

S Z \ S Z ) = S Z \S Z ) , S Z = -8,-8+1,. ..,8-1,8 (8.1) 

For the functional integral representation, it is more convenient to use the 
overcomplete coherent state basis 

s-n\n) = s\n) (8.2) 

where (i is a unit vector. For s = 1/2, we can write this basis in terms of 
the spinor z a , a = +1/2: 

= z a \a) (8.3) 

where: 

(1 = z * a a afS z 13 (8.4) 
in terms of the spherical angles 6, (f> of Q 

A^/ 2 cosf\ 
Z ~ Ve^srnf J ^ 

109 
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and z* a z a = 1. Of course, there is arbitrariness in our choice of the overall 
phase of z a , but so long as we choose a phase and stick with it, there is no 
problem. Therefore, the states and have overlap: 

= z'* a z a (8.6) 

To obtain larger s, we can simply symmetrize 2s spin— 1/2's: 

\n) = z a ' ...z a ^\a u ...,a 2s ) (8.7) 

with 

(n'\n) = {z'* a z a ) 2s (8.8) 

In terms of the spherical angles, 

<z|0> = Q(l + cos0)e-^ 

= Q (l + « • fl) e"*y (8.9) 

Hence the general relation is: 

<0'|0> = Q (l + ti' ■ ti) e"^ (8.10) 

where is the phase of z'* ^". In this basis, the resolution of the identity 
is given by: 

/= ^£±1 y d 2 d|fi)(ji| (8.H) 

as may be seen by taking its matrix elements between states (s\ and \s — n). 
The usefulness of this basis lies in the following property: 

(n\f(s)\ti) = f( s n) (8.12) 

To see this, use (8.7) to write this as: 

(n\f(S)p) = z*^ . . . z*^{/3 1 , . . . ,fc s \f + . . . + ^) . . . z^\ ai , a 2s ) 
= f(sti) (8.13) 

where we have used Q = z* a a a p z 13 in the second line. 
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Let us construct the functional integral representation for the partition 
function of a single spin. Following our derivation of the path integral in 
chapter 8, we write the imaginary-time evolution operator as 



,-PH{S) 



-AtH(S) 



N 



where NAt = (3. Then we can write the partition function as: 



Tr j(V A ^)) 
Taking At — > 0, we have 



N 

n 

i=i 



2s + 1 , 2 



47T 



-AtH(S) 



(8.14) 



(8.15) 



i+l 



-AtH(S) 



a 



e -Ar/((4)- S i(Ar) 2 (g) +^r^ +1 -< + V,° ) ( 8 .1 6 ) 



The second term in the exponent was obtained by making the approximation 

(i + n> . n) s = (2 - \ (n - ff) 2 ) * « 2 s (i - i(Ar) 2 ) J « 2 s e -^mT 



while the third term follows from 



e a0 _ ^sin, 



= eM 



a. -.a. ~*a ~ol 

Z i + 1 Z i+l Z i 



(8.17) 



(8.18) 



Hence, we have 



Tr l( e -ArH(S)\ 



N 



Ar^O / 11 47T 



(8.19) 

The (At) 2 term can be dropped and we can write the partition function as 



Tr{e-P H ®}=N J DQ(r)e 



where 



,dz a 



S SP uM = I dr ( H(sn) - 2sz* a — 



(8.20) 
(8.21) 



In terms of the spherical angles, the second term can be written 
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is / d(f> cos 



> /# ( 1 " f 

J \ J cos 



is I dcf) (1 - / d (cos 6) 
/cose(0) / 

= is(27r-^) ' (8.22) 

where A is the area of the region of the sphere enclosed by the curve traced 
out by Q,(t). Actually, the curve traced out by S7(r) divides the surface of 
the sphere into two pieces, so there is some ambiguity in the definition of A. 
However, these two areas add up to 4tt and e Ants = 1, so we can take either 
choice of A. 

We can check that we get the correct equation of motion from this La- 
grangian. As you will show in the problem set, 

/ dz a \ dz a dz* a 

\ dr I get dr 



while 
so that 



6(1 = 5z* a a af3 z p + z* a 3 a p5zP (8.24) 



5 [ z* a ^f-) =5ti-nx^- (8.25) 
dr ) dr 



Hence, the equation of motion following from our Lagrangian is: 

is tix^ = ™ (8.26) 

dr an 

Since Q is a unit vector, Q and dQ/dr are perpendicular and the equation 
of motion can be written 

■ dn ~ dH 

is- = flx— 8.27 
dr on 

which is what we expect from dQ/dr = [fi, H\. If the spin is in a magnetic 
field, but is otherwise free, H = fi B ■ S, then the spin precesses about the 
magnetic field: 

i^ = fiftx B (8.28) 
dr 

The time-derivative term, 

S = s J dr (-icosd^J (8.29) 
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can be written in the alternative form: 

S = s J dr ^-iA^y^j (8.30) 

where A(Q) satisfies 

V fi xl(n)=sQ (8.31) 

This clearly leads to the same equation of motion and leads to the following 
interpretation of a spin: the spin can be modelled by a charged particle 
moving on the surface of a sphere with a magnetic monopole of magnetic 
charge s located at the origin. By Stokes' theorem, the action is given by 
the magnetic flux through the area enclosed by the orbit. 
It can also be written in the form 

s -W*(-'^- fix ^) (832) 

where 0(r, r) interpolates between the north pole, z and O(t): O(0, r) = z, 
0(1, r) = 0(t). 0(r, r) thereby covers the region of the sphere enclosed by 
0(t). To see that the action (8.32) gives the area of this region, observe that 
the integrand is equal to the Jacobian of the map from the (r, r) plane to the 
surface of the sphere. Hence, the action depends only on 0(1, r) = 0(r) and 
is independent of the particular interpolating function 0(r, r). To see that 
the same equation of motion results from this form of the action, imagine 
varying the upper limit of integration: 

8 " x l) (833) 

so that (50(r) = 0(r',r) — 0(1, r). Then, writing Sr = r' — 1, we have 
<50(r) = 5r{dfl/dr) and: 

SS -- .s / dr5r ( — i^- • O x 



-I 

J o 



dr dr 

dn 



s dr \-iSn nx — (8.34) 



o 



dr 



from which the correct equation of motion follows. Terms of this form are 
often called Wess-Zumino terms. 
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8.2 Ferromagnets 
8.2.1 Spin Waves 

Suppose that we have the ferromagnetic Heisenberg Hamiltonian: 



H = -J^Si • Si (8.35) 

with J > and the sum restricted to nearest neighbors. The equation of 
motion is: 



is ^i = _j s 2J-q. x q. ( 8 . 36 ) 



The ground state of the Heisenberg ferromagnet is one in which the 
SU(2) spin-rotational symmetry is spontaneously broken. The system chooses 
a direction along which to order. Let us call this direction Qq. Linearizing 
about the uniform ground state, 0, = Qq + 5Qi, we have: 



i s ■ 



dr 



= -Js 2 Y^&o x (Sfij ~ (8-37) 



Substituting a plane- wave solution, SQ = ee l ^' Ri , we have the dispersion 
relation of spin-wave theory: 



z 



E(q) = Js[z- JV* 6< (8.38) 



i=i 



where the sum is over the z nearest neighbors. For a <i-dimensional hypercu- 
bic lattice, the cordination number, z, is 2d. For a square lattice of spacing 
a in (i-dimensions, this gives: 

E(q) = 2Js cos ( 8 - 39 ) 

In the small q limit, the spin-waves have quadratic dispersion: 



E(q) = 2Jsa 2 q 2 



(8.40) 
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8.2.2 Ferromagnetic Magnons 

The small q behavior can be obtained directly from the continuum limit of 
the Lagrangian. Since Sf = s(s + 1), 

^ ■ Si = ^ - + const. (8.41) 
the action takes the following form in the continuum limit: 

S = s J d d xdr (^-i A (ft) • ^ + (vfi) ^ (8.42) 

where D = 2Jsa 2 . 

As in the previous section, we linearize the Lagrangian about an ordered 
state which, without loss of generality, we take to be O = z. We write 

SI = (m x ,m y , yjl — m x — rriy) (8.43) 

and assume that m x , m y are small so that we can neglect all terms in the 
action higher than quadratic. We can now write 

A(m x ,m y ) = - (-m y , m x , 0) (8.44) 

since 

V m x A(m x ,m y ) = s(0,0, 1) = sQ (8.45) 
Hence, we can write the action as: 

S = s J d d xdT Q ie Jt m t ^ + ^(Vm f ) 2 j (8.46) 

Introducing the fields m± = m x ± im y , we can write: 

d d xdr [2 m +~g^~ + 2- DVm +" Vm -J ( 8 - 47 ) 

In chapter 8, we learned that the propagator is simply the inverse of the 
differential operator in the quadratic part of the action. The inverse can be 
taken trivially in momentum space: 



9 1 r rjdf? p i(p-x-u>„T) 



x— uj n r) 
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Alternatively, we can expand m± in normal modes: 



™ + (^) = |^,^*« (8.49) 



and compute the propagator directly as we did for phonons. at and 
are called magnon creation and annihilation operators. Magnons are the 
quantum particles which correspond to spin waves in analogy with the cor- 
respondence between phonons and sound waves or photons and electromag- 
netic waves. In the ground state, all of the spins point up. To create a 
magnon, we flip one spin down with a^; to annihilate it, we flip the spin 
back up. 

Using the propagator (8.48), we can compute the magnetization as a 
function of temperature. To lowest order in m±, 



Q z = y/l — m + m- « 1 — -m + m_ (8.50) 



Hence, 



(n z (x,r)) = 1 - i (m+(x,T)m-(x,T)) 

i _ I ly- f dd P 



s (3^ J (2vr) d iuj n - Dp 2 
. 1 f H d d p f dw ( I I 



(27r) d J 2wi y ' \uj + id - Dp 2 uj - iS - Dp 2 



= 1 / t — r-T / duo ns(^)S(uJ — Dp 



1 r d d p r 
~~s] (2^p J 

-4/(S-^ 2 ) 

X 1 2 ^ f ( T Y /2 r*±l± (8 51) 

2s (2v) d T(d/2) \D J J e x - 1 { ' 

In the third line, we have converted the sum over Matsubara frequencies to 
an integral, as usual, obtaining a contribution only from the real axis. 

Hence in d = 3, the magnetization decreases as M z (0) — M Z {T) ~ Ta. 
In d < 2, however, the integral is divergent so we cannot trust the approx- 
imation (8.50). In fact, this divergence is a sign that the magnetization 
vanishes for any finite temperature in d < 2. Note that at T = 0, the exact 



8.2. FERROMAGNETS 



117 



ground state of the ferromagnetic Heisenberg Hamiltonian is fully polarized 
for arbitrary d. For d > 2, the magnetization decreases continuously from 
its full value as the temperature is increased. For d < 2, M z discontinuously 
jumps to zero as the temperature is raised above zero. 

Thus far, we have neglected anharmonic terms in the magnon Lagrangian. 
By including these terms, we would have interactions between the magnons. 
Magnon-magnon interactions affect, for instance, the magnetization as a 
function of temperature. We will not discuss these interactions here, but we 
will discuss the analogous interactions in the next section in the context of 
antiferromagnetism. 



8.2.3 A Ferromagnet in a Magnetic Field 

Suppose we place our ferromagnet in a magnetic field, B. At zero tempera- 
ture, the magnetization will line up along the direction of the field. Let us 
suppose that the field is in the z direction. Then the action is: 

S = s J d d xdr ^-il(ny^- + ^Z)(W) 2 + fisBn)j (8.52) 

where /i is the gyromagnetic ratio. If we expand about the ordered state, 
Q = Vi z z, then we have the quadratic action: 

f id -+ ( 1 dm_ -> -± 1 

b = s Id x dr I - m + — — + D Vm + • Vm_ — -fisB m + m^ 

(8.53) 

The propagator is now 

(T r(m+ (x,r)m-(0,0))) _ --^J^—^—^ (8.54) 

As a result of the magnetic field, there is a minumum energy cost fiB to 
flip a spin. If we repeat the calculation of the magnetization as a function 
of temperature, we find: 

(n z (x,r)) = 1 - ^ (m+(x,T)m-(x,T)) 

= 1 -l/(0 nB( ^ 2 + ^ /2) (8 ' 55) 



Unlike in the B = case, this integral is not infrared divergent: the magnetic 
field stabilizes the ferromagnetic state against thermal fluctuations. 
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8.3 Antiferromagnets 

8.3.1 The Non-Linear a-Model 

Let us now consider the antiferromagnetic case, 

H = J^Si • Si (8.56) 



with J > and the sum restricted to nearest neighbors. We expand about 
the state in which the spins are staggered - the Neel state: 

tii = (-iyn( Xi ) + - l( Xi ) (8.57) 

/ is the q = part of the spin field while n is the q=(w/a,...,ir/a) part. We 
only keep the Fourier modes near these wavevectors. n and I are assumed 
to be slowly varying and n ■ I = 0. Then Q 2 = 1 is satisfied to 0(l/s 2 ) if 
n 2 = 1. With this decomposition, we can write 

H = Js 2 ^2 \-n{ x i) ■ n(xj) + K x i) " K x j)j 

= Js 2 Y^ \{n{xi) - n{ Xj )) 2 + — (f(xi) + f(xj)) 2 ) + const.(8.58) 

going to the continuum limit, we have: 

H = va~ d J d d x (J-P + ^(Vn) 2 j (8.59) 

where v = 2Jsa. 

The corresponding action is: 

5 = .-/A*(-^((-l)M + i^.((-l)<* + i|) 

+ — V P + £^„(Vn) 2 ) (8.60) 
sa A 

Using (8.57) and 

V n x ((-l) l n xl)= (-l) m n + - I 

= (-iyn+-J+(-iy +1 d x n 
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« fi (8.61) 

we can express A in terms of n and I if we drop the gradient term in the 
penultimate line (this cannot be done in d = 1, where it is absolutely crucial, 
but can in higher dimensions). Neglecting oscillatory terms in the action, 
we have: 

S = a- d f d d xdr ( Z -fixl-^ + -vP+ S 4v(Vn) 2 
J \a dr sa 4 

The functional integral 

J VlVne~ s ^ (8.63) 

is a Gaussian integral in I, so we can perform the I integral. Integrating out 
I, we have: 




(8.64) 



Or, writing g = a d 2 /Js 2 , 




(8.65) 



This action is called the 0(3) Non-Linear a Model, or 0(3) NLuM for 
short. The 0(3) refers to the fact that the field n is a three-component field 
which transforms as a vector under the rotation group, 0(3). The model is 
non-linear because there is a non- linear constraint, n 2 = 1. 




8.3.2 Antiferromagnetic Magnons 

Let us, for simplicity work in a system of units in which v = 1. We can 
always rescale our time coordinate at the end of any calculation so as to 
restore v. Let us also employ the notation fj, = 0, 1, . . . , d. with referring 
to the time direction so that do = dt- Then we can write the action of the 
0(3) NLctM as: 

S=- f d d xdT{d ll nf (8.66) 
If, as in the ferromagnetic case, we expand about an ordered state, 



n 



(n x ,n y ,^Jl-n 2 x -nl) (8.67) 
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then we can write the action as: 



S= l -[ JiBdr ( (d,n t f + U * d f l " A " J > ) (8-68) 
9 J V 1 - n i n i J 

where z = 1,2 and n\ = n x , ni = n y . Let us rescale the fields so that 
n i ~^ y/gni- Then the action becomes: 

S = [ d^dr ( (8,n t f + g (8.69) 

In order to do perturbation theory in g, which we can hope to do when 
it is small, we divide the action into two parts, 

S fr cc = J d d xdr {d^rii) 2 (8.70) 



and 

<Sint 



= / d d x dr g ^ n ^ Ui (8.71) 
J 1 - grurii 

Note that Si nt contains all powers of g when the denominator is expanded 
in a geometric series. 

Sf ree is very similar to the phonon action; as in that case, we can expand 
rii in normal modes: 

*) = / j0T2 ir k ( a */ fcV+ ^ T + a lr tr ~~ vkT ) (8 - 72) 

a± = rii ± a,y create, respectively, up- and down-spin antiferromagnetic 
magnons. Note the difference with the ferromagnetic case, in which there 
was only one type of magnon. Since there is no net uniform magnetization in 
the ground state, we can either flip a spin up - thereby creating an up-spin 
magnon - or flip a spin down - thereby creating a down spin magnon. 

We can obtain the antiferromagnetic magnon propagator using this mode 
expansion or by inverting Sfree: 

(T r (n,(f,r)n,(0,0))) = J J^^JT^T (8-73) 

or, restoring v, 
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The corresponding spectral function is: 

B(u,p) = ^S(u-vp) - ^-S(u + vp) (8.75) 

With this propagator in hand, we can compute the staggered magneti- 
zation in the g — > limit. It is given by 

n z = \J\ - g niiii k, 1 - - g UiUi (8.76) 

Hence, 

(n z (x, r)) « 1 - g - (n^x, r)n,(x, r)) 

1^ f d d p 1 



= 1 


~ 9 2 


= 1 


-/ 


= 1 


-y 


= 1 





(2vr) d 

(^ 2^ (nB M " nB ( "^ )} 

(0^ C -(^) ( ^ 

Unlike in the ferromagnetic case, the second term does not vanish in the 
T -> limit. For T = 0, we have: 

If we approximate the Brillouin zone by a sphere, \p\ < ir/a, then we find 

(n z (x,T))f*sl-g- (-) , \, ; r (8.79) 

x v ' ;/ y d-l\a) (27r) d r(d/2) v ' 

Hence, the staggered magnetization of an antiferromagnet is less than 1 
even at T = 0, unlike the uniform magnetization in the ferromagnetic case. 
The Neel state, with neighboring spins oppositely oriented, is not the exact 
ground state. In d = 1, the integral is actually logarithmically divergent. 
This divergence hints at the impossibility of antiferromagnetic order in d = 
1, which is a consequence of a theorem which we will prove in the next 
chapter. For T finite, the integral (8.77) is logarithmically divergent at 
small p in d = 2, just as in the ferromagnetic case. Again, it is a sign of the 
impossibility of antiferromagnetic order at finite temperatures in d = 2. 
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8.3.3 Magnon-Magnon-Interactions 

Thus far, we have ignored the higher-order terms in the NLcM. These terms 
lead to interactions between magnons. To get an idea of the nature of these 
terms, let's expand S- mt to 0(g 2 ): 

S, = /***■ + 9 n t 8„n t n Anj + 9 WA», n A n }1 - m ,n, + 

(8.80) 

We need only these terms in order to do computations to order 0(g 2 ). The 
Feynman rules for this action are: 

• Assign a directed momentum and Matsubara frequency to each line. 
Assign an index i = 1, 2 to each line. For external lines, the momentum 
and frequency are directed into the diagram. 

• For each internal line with momentum, frequency q, iu> n write: 

ly^ f d d p 1 

• For each 4-leg vertex with momenta, Matsubara frequencies (p\,u ni ), ■■■ ,(p4, oo n4 ) 
directed into the vertex and indices ii, . . . ,14 associated to these in- 
coming lines, write: 

(vh^is + Pi ■ P3) (2vr) d (5(pi +P2+P3+ Vi) P ^ni +ri2 +ra3 +?i4 ,0 

• For each 6-leg vertex with momenta momenta, Matsubara frequencies 
(pi,uj ni ), . . . , (j)6,L0 n6 ) directed into the vertex and indices ii, . . . , i$ 
associated to these incoming lines, write: 

{u h U}i 3 +Pl -ps) {2TT) d 5(p 1 +P2+P3+P4+P5+P6) P 

• We assign a factor 1/5 to the diagram if there are S permutations of 
the vertices and external momenta which leave the diagram invariant. 

8.4 Spin Systems at Finite Temperatures 

In the previous two sections, we saw that ferromagnetic and antiferromag- 
netic order are suppressed by thermal fluctuations. Let us examine this 
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more closely. Let us, for the sake of concreteness, consider the case of an- 
tiferromagnetism. Let us re-write the action in the following dimensionless 
form: 



gv J Jo 



where u = vr/a and y = x/a. If we go to momentum space, 

s = °^[wf ^? G 1 "^' q) ? + \ 9)|2 ) (8 ' 82) 

then the cutoff is just ir (for a spherical Brillouin zone; more generally, it's 
some number of order 1). The Matsubara frequencies are u> n = 2irna/(3v. 
When a/ [3v » 1 - i.e. at temperatures which are large compared to v/a (the 
energy of a magnon at the cutoff wavevector) - the configurations with uj n ^ 
are strongly suppressed and give very little contribution to the functional 
integral. Therefore, the functional integral is dominated by configurations 
which are independent of r and we can replace j^dr — > (3. Hence, we may 
make the approximation: 



/ 
/ 



Pne-^WII^=MI 2 ) (883) 



We can similarly write the ferromagnetic functional integral with momentum 
cutoff of order 1 and Matsubara frequencies u> n = 2irna 2 / ' (3D: 



I 
I 



Vne- SadpI ^^ n= °' q) ^ (8.84) 



Hence, the functional integrals for the ferromagnet and antiferromagnet 
are identical at temperatures large compared to v/a or D/a 2 . Both systems 
are described by the d-dimensional NLoM. The differences between ferro- 
magnets and antiferromagnets, which have to do with their dynamics, are 
unimportant in the limit of classical statistical mechanics, which is the limit 
which we have just taken. Thus we can kill two birds with one stone by 
studying the functional integral 

Z = Jvne-^^(\(Vnf) (g>85) 
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This functional integral would be a trivial Gaussian integral if it were 
not for the constraint n 2 = 1. To impose this constraint, let's intoduce a 
Lagrange multiplier: 

Z = j VnV\e-P /^(*(Vfl) a +A(2)(n»-i)) (8 . 86) 

Now the functional integral is, indeed, a Gaussian integral in n which we 
can do: 

Z = j VXdet (V 2 + A(f))" 1/2 e -Pf dd3X W 
= j vXe-^H-^+myP f d d x\(x) ^ 87 ) 

Unfortunately, we can't do the resulting integral exactly, but we can try to 
use the saddle-point approximation. The saddle-point of the argument of 
the exponential is given by: 



_6_ 



Tr In (-V 2 + \(x)) - (5 j d d x\{x)^j = 



(8.88) 



If we look for a saddle-point solution Ao for which \{x) is independent of 
position, then this is simply 



/(0 ln ^ + A )- /3A ) =O (8 ' 89) 



d ( f d d q 
d\ 



or, 

1 (8.90) 



/ 



(27r) d q 2 + A 

If we aproximate the integral by using Ao as an infrared cutoff, then we have: 

1 Xp 1 ~ (3 (8.91) 



a d-2 



For T — ► 0, there is no spatially homogeneous saddle-point solution, Ao, if 
d > 2. For high-temperature, however, there is a solution of the form: 

V 2 ~ ^2 - P ( 8 - 92 ) 
In d = 2, there is always a solution: 

A ~ -l e -( const -)/3 ( 8 .93) 
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When the functional integral is dominated by a non-zero saddle-point 
value Ao, we can approximate it by: 

Z = j Vne'P l^(i(Vn) 2 +A„n 2 ) (8>94) 

which is a Gaussian integral. This Gaussian theory is called a linear cr-model 
This describes the high-temperature phase in which thermal fluctuations 
have disordered the magnet. From (8.94), we can see that 

(n(x)) = (8.95) 

Furthermore, using the real-space Green function which you have calculated 
in the problem set, we see that correlation functions of the magnetization 
decay exponentially with distance 

^H°))~p^ e " |l|/C ( 8 - 96 ) 

where we have defined the correlation length £ ~ \/Ao- As the temperature 
is lowered and Ao — ► 0, the correlation length grows. Finally, a transition 
takes place and the magnet orders. In the saddle-point - or mean-field 
- approximation, this occurs at Ao = 0. The saddle-point approximation 
would tell us that 

<^H°)> ~ ( 8 - 97 ) 

at the critical point, T = T c . However, as we will discuss in chapter 11 - and 
as you have investigated in the problem set - the saddle-point approximation 
is often incorrect. For temperatures below T c the magnet is ordered, and we 
can expand about the ordered state, as we did in the previous two sections. 
To summarize, there are 4 regimes for a ferro- or antiferromagnet: 

• High temperature, T > T c , where the system is described by a linear 
cr-model, 



Z = Jvne-^^i^nf+x^) 



with Ao > 0. Correlation functions fall off exponentially with correla- 
tion length £ ~ \/Aq. 



• The critical point, T = T c , at which correlation functions have power- 
law falloff. 
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• The ordered phase, < T < T c , where the magnetization (or stag- 
gered magnetization) has a non-zero expectation value. This regime 
is described by the d-dimensional NLcrM: 

Z = Jvne-Pf dds (^ 2 ) 

which can be expanded perturbatively about the ordered state. 

• The ordered state at T = which is described by the d+l-dimensional 
NLcrM in the antiferromagnetic case, 

and by the following functional integral in the ferromagnetic case. 

z = z = J v ^ e -sf d ^ d r(-iA(n).§ HD(vnf) (g98) 



8.5 Hydrodynamic Description of Magnetic Sys- 
tems 

In the limit in which magnon-magnon interactions are strong, it is hope- 
less to try to expand perturbatively about a quadratic action in either the 
ferromagnetic or antiferromagnetic cases. However, some properties of cor- 
relation functions can be deduced from hydrodynamic equations. 

A ferromagnet satisfies hydrodynamic equations very similar to those of 
a conserved particle density which we discussed in chapter 7. The magne- 
tization, Q is a conserved quantity, so the deviation from an ordered state, 
5Q = ft — Qq satisfies a conservation law: 

^-SQi + V • .P = (8.99) 
at 

and a constitutive relation: 

J 1 = -x^i^VSfli + VBi (8.100) 

(xo is the static magnetic susceptibility) from which it follows that the mag- 
netization has diffusive correlation functions: 



(8.101) 
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In the case of an antiferromagnet, however, the staggered magnetization, 
ft, is not conserved, so it does not diffuse. If the system is ordered with n = z, 
then the correct hydrodynamic equations for i = 1, 2 are: 

— (e^n,Vn fe ) = xl\T)VU (8.102) 

These equations are the equations of motion for n, I which follow from the 
NLcjM with p s = 1/g and x± = ^-/gv 2 - Rotational invariance dictates that 
the hydrodynamic equations must hold generally albeit with different values 
of p s and x±- These equations can be combined to give: 

^ (e^Vnfc) = p s (T) X ; 1 (T) V 2 (e ijfe n, Vn fe ) (8.103) 

from which it follows that there is a propagating mode of velocity \J p s {T)x~]}{T). 
A more refined analysis, which includes higher-order terms which have been 
neglected above leads to a small damping of this mode which goes as q 2 . 

8.6 Spin chains** 



8.7 Two-dimensional Heisenberg model** 
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Symmetries in Many-Body Theory 



9.1 Discrete Symmetries 

Symmetries constrain the behavior of physical systems and are, therefore, a 
useful tool in the analysis of a many-body system. In general, a more sym- 
metrical system is more highly constrained and, consequently, more easily 
solved. The most useful symmetries are continuous symmetries - ie. symme- 
tries which belong to continuous families - which we discuss in the remainder 
of this chapter. The simplest symmetries are discrete, and we focus on them 
in this section. We will focus on the archetypal discrete symmetries, parity, 
P, and time-reversal, T. 

A discrete symmetry is a transformation of the fields in a theory, <p(x, r) — > 
<p'(x,t) which leaves the action unchanged. Since the classical equations of 
motion are just the stationarity conditions on the action, a discrete sym- 
metry takes one solution of the equations of motion and transforms it into 
another. At the quantum level, we would like such a transformation to be 
effected by a unitary operator, U : 



Parity is an example of such a symmetry. We will call any transformation 
of the form 



<p(x, t)U = ip'(x, t) 



(9.1) 



if a (x,r) M ab <p b (-x,T 



) 



(9.2) 
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a parity transformation, but M a b is typically ±5 a b- Let us consider, as an 
example, our action for interacting phonons. 



S 



I 



drd A r 



(9.3) 



The parity transformation, 

m(x, t) — »■ —Ui(—x, t) (9.4) 
leaves (9.3) invariant. If we define the unitary operator Up by 

U P Ui(x,T)U P = -Ui(-x,r) (9.5) 
then U has the following effect on the creation and annihilation operators: 



v i> u k 1 ~ a -k, s 

— k,s 



Utat Up 

r k,s 



(9.6) 



Hence the vacuum state of a free phonon system, g = 0, which is defined by: 

%M = (9-7) 

is invariant under parity: 



U\0) = \0) 



(9.8) 



If we assume that the ground state evolves continuously as g is increased 
from so that the g ^ ground state is also invariant under parity, then 
parity constrains the correlation functions of the interacting theory: 

(0|T r (uj^zi.Ti) . ..u in (x n ,T n ))\ 0) = (-l) n (0\T T (u^-a^Ti) . . . u in (-x n ,T n 

(9.9) 

Note that 

Ui(x,r) -»■ -Ui(x,r) (9.10) 

is also a symmetry of (9.3), so we can take any combination of this and 
parity, such as 

Ui(x,r) — >■ Ui(-x,r) (9-11) 

It doesn't really matter what we call these various symmetries so long as 
we realize that there are two independent symmetries. Realistic phonon La- 
grangians have cubic terms which are not invariant under (9.10), so usually 
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the parity transformation (9.4) is the only symmetry. The symmetry (9.10), 
when it is present, leads to the relation 

(0 \T T {u h {x U Ti) . . . u in {x n , T n ))\ 0) = (-l) n (0 I T T (m^xun) . . . u in {x n , r„))| 0) 

(9-12) 

which implies that correlation functions of odd numbers of phonon fields 
must vanish. 

A spin, on the other hand, transforms as: 

u j P n{x,t)u P = n(-x,t) (9.13) 

The time-derivative term in the Lagrangian is not invariant under Q — > — S7, 
so there is no arbitrariness in our choice of parity transformation. 

Time-reversal is a symmetry which does not quite fit into this paradigm. 
Reversing the direction of time, t —t takes one solution of the equations 
of motion into another, but it does not necessarily leave the action S = 
J f cItL(t) invariant. Nevertheless, we might expect that there is a unitary 
operator, Ut, which transforms the phonon field Uj(i) — > Ui(—t) 

U^ 1 u i {t)U T = u t (-t) (9.14) 

In fact, this operator cannot be unitary. To see this, differentiate both sides 
of (9.14): 

U^dtUiWUx = 

Act on 

[u i (x,t),pd t u j (x',t)] 

with U^ 1 and Ut- 

U T X [ui(x,t),pd t Uj(x',t)] U T 
[ui(x, -t), -pd t Uj(x ', -t) 
- [ui(x,t),pd t Uj(x',t) 
— i 5ij5(x — x) 

which is a contradiction. 

The time-reversal operator must actually be an antiunitary operator. An 
antiunitary operator is a type of antilinear operator. While a linear operator 
O satisfies: 

O (a|V> + 0\%)) = aO\il>) + 0O\ X ) (9.18) 
an antilinear operator satisfies 



-d t Ui(-t) 
i 5ij5(x — x') 



(9.15) 
(9.16) 



Ut 1 i 5ij5(x — x') Ut 
i 5ij5(x — x') U^Ut 
i 5ij5(x — x') 
i 5ij5(x — x') 



(9.17) 



0(c#> + P\x)) = cfOW) + (3*0\ X ) 



(9.19) 
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An antiunitary operator is an antilinear operator O which satisfies 

(6x,6^) =( X ,V) (9.20) 

where we have used the notation (x, Y>) to denote the inner product between 
the states \ip) and \\)- 

The time-reversal operator, Ut, is an anti-unitary operator, which ex- 
plains how the paradox (9.17) is avoided. While the phonon field has the 
time-reversal property (9.14), a spin must transform under time-reversal so 
as to leave invariant the time-derivative term in the action: 




(9.21) 



Evidently, the correct transformation property is: 

U T 1 Q(t) U T = -n{-t) (9.22) 

Hence, the ferromagnetic, (Q(x,t)) = Qq, and antiferromagnetic, (Cl(x,t)) = 
(— l)*no, ground states are not time-reversal invariant, i.e. they spon- 
taneously break time-reversal invariance, unlike the phonon ground state 
which does not. The antiferromagnetic state also breaks the discrete sym- 
metry of translation by one lattice spacing. However, the product of T and 
a translation by one lattice spacing is unbroken. 



9.2 Noether's Theorem: Continuous Symmetries 
and Conservation Laws 

Before looking at continuous symmetries in quantum systems, let us review 
one of the basic results of classical field theory: Noether's theorem. This 
theorem relates symmetries of the action to the existence of conserved cur- 
rents. 

Suppose we have a classical field theory defined by an action and La- 
grangian density: 

S = J dtd 3 r£(0,d<j),r) (9.23) 
where (f> is the classical field. Consider a transformation 



<j>(r, t) <Xr, t, A) , 4>(f, t, 0) = <f>(r, t) 



(9.24) 
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and define the infinitesimal transformation 

Then, this transformation is a symmetry of the action if and only if 

DC = (9.26) 

for any (j) (i.e. not only for <j) satisfying the equations of motion). (Greek 
indices take the values 0,1,2,3 where = it; relativistic invariance is not 
implied.) 

Now, a general expression for DC can be obtained from the chain rule: 

dC 

DC=—D<j ) + T, lJi D{d^) 
= d^iTf, D<p + ir^D (d^(f>) 

= d^{-K^D<t>) (9.27) 

We used the equations of motion to go from the first line to the second and 
the equality of mixed partials, Ddcf) = dDcf), to go from the second to the 
third. 

Setting these two expressions equal to each other, we have 
Noether's theorem: for every transformation which is a symmetry of 
the action - i.e. DC = d^F^ - there is a current = (p,j), 

j lt = n lt D<l>-F li (9.28) 

which is conserved, 

d„j„ = d t p + V-j = (9.29) 

The extension to theories with multiple fields is straightforward and can be 
accomodated by decorating the preceding formulas with extra indices. 
As an example, let's consider space and time translations: 

Mx^^Kftxn + Xen) (9.30) 

where = (t, f) and e M is an arbitrary 4-vector. Then, 

D<j) = e a d a (f> 

DC = d a (e a C) (9.31) 
Hence, the conserved current is 

jp = e a T al3 (9.32) 
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where 

T af3 = ir a dp4> - 5 af5 C (9.33) 

This is the stress-energy tensor. Tq^ is the 4-current corresponding to time- 
translation invariance: Too is the energy density and T$i is the energy 3- 
current. T^ are the 4-currents corresponding to spatial translational invari- 
ance: To are the momentum densities and Tjj are the momentum currents. 
In our theory of an elastic medium, T a p is given by: 

2oo = H. 

Toi = pd t UjdiUj 

Tij = -2fidiU k djU k - d k u k djUi - 5ijC (9.34) 

Tij is the stress tensor of the elastic medium. 

Our action for a spin - as well as our actions for ferro- and anti-ferromagnets 
- is invariant under spin rotations, £l a — ► TZ a b£li,. In the case of an ferro- 
magnet, this leads to 3 conserved quantities corresponding to spin rotations 
about the three different axes: 

(j*,/) = (n\De ijk n j vn k ^ (9.35) 

In the antiferromagnetic case, they are: 

J li = ^jkrijd^nk (9.36) 

In general, symmetries are related to unobservable quantities. In the 
above, the conservation of momentum follows from the unobservability of 
absolute position; the conservation of energy, from the unobservability of 
absolute temporal position. Angular momentum is a consequence of the 
unobservability of absolute direction. 

9.3 Ward Identities 

In the previous section, we discussed the consequences of continuous sym- 
metries and conservations laws for classical systems. We now turn to the 
quantum theory, where the existence of continuous symmetries and their 
associated conservation laws leads to important constraints on correlation 
functions. These constraints are called Ward identities. The Ward identity 
relates the divergence of a time-ordered correlation function of a conserved 
current, j M , with some other fields, <fi to the variations of those field un- 
der the symmetry generated by j'q. The variation of tp(x,t) under such a 
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symmetry operation is: 



D<p(x, t) = 



d d x [J°(x',t),<p{x,t 



)] 



(9.37) 



To derive the Ward identities, we consider a correlation function of j a 
with the (fis: 

(Tr(j^{x,T)ip 1 (x 1 ,T 1 ) ... (p n (x n ,r n ))) = 

0(t - ti)0(ti - r 2 ) . . . 0(r n _i -r n ) M (a;,r)^i(xi,ri) ... ip n (x n ,T n )) + 
9(n -t)6(ti - r 2 ) . . . 0(r n _i - r n ) (v9i(xi,n) j M (x,r) ... ¥> n (x n ,T n )) 



If we differerentiate this with respect to x M , the derivative operator can act 
on a ^-function which has r in its argument or it can act on j^(x, r). If the 
symmetry is conserved in the classical field theory, then we can ordinarily 
conclude that d a j a {x,T) = 0. However, it is possible for this equation to be 
violated in the quantum theory when there is a cutoff (the conservation law 
can be violated when the conserved quantity flows to wavevectors beyond 
the cutoff) and this violation can remain finite even as the cutoff is taken 
to infinity. Such a symmetry is called anomalous. If the symmetry is not 
anomalous, then the right-hand-side contains only terms resulting from the 
derivative acting on the 0-function to give a 5-function: 

d fl {T T (j fi (x,T)ip 1 (x 1 ,T 1 ) ... ipn(Xn,r n ))} 

= 5(t - ti)0(ti - r 2 ) . . . 0(r n _i - T n ) m (x,t) <£>i(xi,ti) . . . (p n (x n ,T n )) 
- <5(ti - t)0(ti - r 2 ) . . . 0(r n _i - T n ) (</?i(xi,ti) ju(x,T) . . . </?„(x n ,r n )) + 

= S(t - Ti)0(ti - T 2 ) . . .0(T n _l - T n ) ( \j u (x, t) , (Xl, Tl)] . . . </?„(x n ,T n )) + . 

= <5(x - xi)<5(r - ri) (T T (Dip 1 (x 1 ,T 1 ) . . . y n {x n ,T n ))) 

+ 5(x - Xl)5{T - Tl) (T T ((^i(xi,Tl) D(f 2 (x 2l T 2 ) ■ ■ ■ ¥n{Xn,T n ))) + ... (9. 

The final equality is the Ward identity: 

da (T T (j M (x,r)v3l(xi,Tl) ... <p n {Xn,T n ))) 

= S(X - Xi)8(t - Tl) (T T (Dipi(xi,Ti) . . . ipn(Xn,T n ))) 

+ S(X - X 2 )5(t - T 2 ) (T T (v?i(xi,Ti) D(f 2 (x 2 ,T 2 ) . . . (p n (Xn,T n ))) 

+ ... (9.40) 

As an example of the Ward identity, consider an antiferromagnet, for 
which the spin currents are: 



+ ... 



(9.38) 



(9.41) 



136 



CHAPTER 9. SYMMETRIES IN MANY-BODY THEORY 





Figure 9.1: Diagrams contributing to the (a) left-hand-side and (b) right- 
hand side of the Ward identity (9.42). 

Then the Ward identity tells us that: 

dn ( T r {tij s nj{x, r)d pi n s (x, r) n k (xi,T{) ni(x 2 ,r 2 ))) 

= 5(x - xi)6(t - Ti)e ikm (T T (n m (x 1 ,T 1 )ni(x 2 ,T 2 ))) 
+ 5(x - x 2 )5(t - T 2 )e ar (T T (n l (x 1 ,T 1 ) n r (x 2 , r 2 )($.42) 

This is a non-trivial constraint when imposed order-by-order in perturba- 
tion theory, since the correlation function on the left-hand-side is given by 
diagrams such as those in figure 9.1a while the right-hand-side is given by 
diagrams such as those of 9.1b. 

As another example, consider a ferromagnet which is ordered along the 
z axis, (Q z ) = 1:. Q x generates rotations about the x-axis, so the following 
correlation function is of the form for which the Ward identity is applicable: 

(n x (iu n ,0)n y {iLo n ,0)) (9.43) 

with Jo = £l x and — > = (iu n ,0). Hence, the Ward identity tells us 
that: 

iuj n {ti x (iuj n , Q)Qy(iLu n , 0)) = (O z ) = 1 (9.44) 

or, 

(n x (ioo n , 0)n y {ico n , 0)) = — (9.45) 

We found the same result earlier for a linearized theory in which magnon- 
magnon interactions. The Ward identity shows that this result is exact, 
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i.e. the tree-level result is unchanged by the inclusion of magnon-magnon 
interactions. The divergence of this correlation function at low frequency 
is an example of Goldstone's theorem in action, as we will see in the next 
section. 



9.4 Spontaneous Symmetry-Breaking and Gold- 
stone's Theorem 

Often, the ground state is invariant under the symmetries of the Lagrangian. 
Our phonon Lagrangian, for instance, is invariant under parity and time- 
reversal, and the ground state is as well. However, this is not the only 
possibility, as we have already seen. It is possible that there is not an invariant 
ground state, but rather a multiplet of degenerate symmetry-related ground 
states, in which case we say that the symmetry is spontaneously broken. 
In terms of correlation functions, the statement of spontaneous symmetry- 
breaking is 

(4(x,t))^0 (9.46) 

where 4>(x, r) is a field which is not invariant under the symmetry, <f>(x, r) ^ 
4>(x, r) U. Such a field is called an order parameter. 
For instance, our field theory for the Ising model, 

Z = N y^ e -^M^(^) 2 +fV+iV) (9 . 47) 

is invariant under the Z2 symmetry ip — > — ip which is broken for T < T c 
(i.e. r < 0), (ip) = ±^Qr/u and unbroken for T > T c , (ip) = 0. As 
the temperature is lowered through T c , the system spontaneously chooses 
one of the two symmetry-related configurations (up) = ±y/6r/u, either as 
a result of a random fluctuation or some weak external perturbation. The 
ferromagnetic and antiferromagnetic ground states are two more examples 
of spontaneous symmetry breaking: the Heisenberg model and the field 
theories derived from it, 

S = s j d d xdT l-iA(ny^ + ^D(ytiy) (9.48) 



and 



(9.49) 
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are invariant under SU{2) spin-rotational symmetry, — > TZijVLj, rii — > 
TZijUj, but the ground states are not invariant since the magnetization or 
staggered magnetization chooses a particular direction. The signal of sponta- 
neous symmetry breakdown is the non-invariant expectation value (St) / 
or (ft) / 0. At high-temperature, T > T c , the symmetry is restored and 
($7) = or (n) = 0. These expectation values also break the discrete T 
symetry. A ferromagnetic in a magnetic field does not have an SU (2) or 
T-invariant Lagrangian, so its ferromagnetic ground state is an example of 
an explicitly broken symmetry rather than a spontaneously broken one. The 
[iB ■ Q term is called a symmetry-breaking term. The phonon Lagrangian is 
actually another example: the translational symmetry of the continuum is 
broken to the discrete translational symmetry of the lattice. At high tem- 
perature (when our continuum elastic theory is no longer valid), the lattice 
melts and the resulting fluid state has the full translational symmetry. 

In the first example, the Ising model, the broken symmetry is discrete, 
and there are no gapless excitations in the symmetry-broken phase. In 
the other two examples, magnets and ionic lattices, the broken symmetry, 
since it is continuous, leads to gapless excitations - magnons and phonons. 
This is a general feature of field theories with broken symmetries: broken 
continuous symmetries lead to gapless excitations - called Goldstone modes 
or Goldstone bosons - but broken discrete symmetries do not. 

Physically, the reason for the existence of Goldstone bosons is that by 
applying the generator of the broken symmetry, we obtain another state 
with the same energy as the ground state. For instance, if we take a magnet 
aligned along the z axis and rotate all of the spins away from the z axis then 
we obtain another state of the same energy. If the spins instead vary slowly 
in space with wavevector q, then the energy of the resulting state vanishes 
as g — > 0. These states are the Goldstone modes. 

The number of Goldstone modes is at most dimG — dimH if G is the 
symmetry group of the theory and H is subgroup of G which is left un- 
broken. If H = G, i.e. the symmetry is completely unbroken, then there 
no Goldstone bosons. In the case of an antiferromagnet, G = SU (2) and 
H = U(l) - the group of rotations about staggered magnetization axis - 
so there are dimG — dimH = 2 gapless modes. A ferromagnet in zero field 
has only one Goldstone mode while dimG — dimH = 2. A ferromagnet in a 
finite field has no Goldstone modes; G = H = U(l), the group of rotations 
about the direction of the field, so dimG — dimH = 0. A crytal has only 
three Goldstone modes, the Uj's, while G is the group of translations and 
rotations, dimG = 6, and H is a discrete subgroup, dimH = 0. 

We will now give a precise statement and proof of Goldstone's theorem. 
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Suppose we have a conserved quantity, J°, and its associated current, J 1 , so 
that 8^ J M = 0. Let ip(x, t) be some field in the theory. tp(x, t) transforms as 

D<p(x,t) = j d d x [J°(x',t),(p(x,t)] (9.50) 

under an infinitesimal symmetry operation corresponding to the conserved 
quantity J°. Then, the following theorem holds. 

Goldstone's Theorem: If there is an energy gap, A, then 

(0\D(p(k = 0,t)\0) = (9.51) 

Conversely, if (0\D(p(k = 0, t)\Q) / 0, then there must be gapless excitations. 
These gapless excitations are the Goldstone modes. 

The proof proceeds by constructing a spectral representation for (0| J°(x', t')tp(x, t)\0): 

(0\j°(x',t')<p(x,t)\0) = J dfikdwe- 1 ^-^-^-^ p{u,k) (9.52) 

where 

p(u,k) = ^<0|j°(0,0)|n)(n|v9(0,0)|0) S(u - E n ) S(k - P n ) (9.53) 

n 

By unitarity, p(uj,k) > 0. The existence of an energy gap, A, implies that 
p(ui, k) = for lo < A. Applying the conservation law to the correlation 
function of (9.52), we have: 

(0\d^(x',t')cp(x,t) \ 0) = (9.54) 

Fourier transforming and taking the k — > limit, we have: 

u (0 |j°(jfe = 0,-lo) <p(k = 0,u)\ 0> = (9.55) 

The left-hand-side can be rewritten using our spectral representation (9.52): 

uj p(k = 0, lo) = (9.56) 

which implies that 

p{k = 0, uj) = (9.57) 

or 

u = (9.58) 
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Hence, p(k = 0, uj) = for all uj > 0. However, the existence of an energy- 
gap, A implies that p(k = 0,uj) = for uj < A and, in particular, for uj = 0. 
Hence, p(k = 0, uj) = for all uj. Therefore, 

(0\j°(x',t')ip(k = 0,t)\0) = (9.59) 

Similarly, 

(0\tp(k = 0,t)J°(x',t')\0) =0 (9.60) 

and, consequently, 

<0|Dy>(fc = 0,t)|0) = (9.61) 

When the symmetry is broken, p(u>, k = 0) does not vanish when uj = 0; 
instead, there is a contribution to p(u>, k = 0) coming from the Goldstone 
modes of the form p(ui, k = 0) = cr<5(u;). Then (0 |Dp(fc = 0, t)\ 0) = cr. Note 
that this proof depended on unitarity and translational invariance. 

A field <p which satisfies 

(0\D(p(k = 0,t)\0) ^0 (9.62) 

is an order parameter, it signals the development of an ordered state. The 
order parameter of a ferromagnet is $7 while the order parameter of an 
antiferromagnet is n. It is not necessary for the order parameter of a theory 
to be the fundamental field of the theory. The order parameters of broken 
translational invariance in a crystal are: 

PG = e i3 < x ^ (9.63) 

where G is a reciprocal lattice vector of the crystal and u is the phonon field. 

If the order parameter, 93, is itself a conserved quantity, Jo = (p, which 
generates a spontaneously broken symmetry, then Dip vanishes identically 
and the associated Goldstone boson doesn't exist. This is the reason why a 
ferromagnet has only 1 Goldstone mode. If the ferromagnet is ordered along 
the z axis, then the symmetries generated by VL X and Q y are broken. If we 
look at the spectral functions for (£l x Q y ) and (£l y £ly), only the former has 
a S(uj) contribution; the latter vanishes. In the case of an antiferromagnet, 
the spectral functions for both (L x n y ) and (L y n y ), have 5(u>) contributions. 
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9.4.1 Order parameters** 

9.4.2 Conserved versus nonconserved order parameters** 

9.5 Absence of broken symmetry in low dimen- 
sions** 

9.5.1 Discrete symmetry** 

9.5.2 Continuous symmetry: the general strategy** 

9.5.3 The Mermin-Wagner-Coleman Theorem 

In chapter 9, we encountered hints that neither ferromagnets nor antiferro- 
magnets could order at finite T in d < 2 and that antiferromagnets could not 
even order at zero temperature in d = 1. Let us now discuss the difficulties 
involved in breaking a symmetry in a low dimensional system. Consider the 
simplest example, namely the Ising model. Suppose the system is ordered 
with all of the spins pointing up. What is the energy cost to create a size 
region of down spins in ti-dimensions? It is simply 

£fluct ~ L d - 1 (9.64) 

i.e. the energy cost of a domain of reversed spins is proportional to the 
surface area of the domain wall since that is the only place where unlike 
spins are neighbors. For d > 1, this grows with L, so large regions of 
reversed spins are energetically disfavored. At low temperature, this energy 
cost implies that such fluctuations occur with very low probability 

Pane ~ e -( CODSt W Ld ~ 1 (9.65) 

and hence the orderd phase is stable. In d = 1, however, the energy cost 
of a fluctuation is independent of the size of the fluctuation. It is simply 
4J. Hence, a fluctuation in which a large fraction of the system consists 
of reversed spins can occur with probability ~ exp(— 4/3 J). As a result 
of these fluctuations, it is impossible for the system to order at any finite 
temperature. This is clearly true for any discrete symmetry. 

Let us now consider a continuous symmetry at finite temperature. For 
the sake of concreteness, let us consider a d-dimensional magnet in an or- 
dered phase in which the magnetization (or staggered magnetization) is 
aligned along the z axis. (Recall that ferro- and antiferromagnets have the 
same description at finite temperature.) The energy cost for a size L d re- 
gion of reversed magnetization is less than in the case of a discrete symmetry 
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since the magnetization at the domain wall need not jump from one degener- 
ate ground state to another. Rather, the spins can interpolate continuously 
between one ground state and another. The energy cost will be the gradient 
energy, 

d d x Q(Vn) 2 ^) (9.66) 
For a fluctuation of linear size L, (Vn) 2 ~ 1/L 2 , so 

d d x (-{Vn) 2 ] ~L d - 2 (9.67) 



/, 



Hence, 



2 



P&uc ~ e -( const -)^ d - 2 (9.68) 

and we conclude that a continuous symmetry can be broken for T > in 
d > 2, but not for d < 2. This confirms our suspicion that magnets can't 
order for T > in d = 2. The dimension below which symmetry-breaking is 
impossible is called the lower critical dimension. For T > 0, the lower critical 
dimension is 2 for continuous symmetries and 1 for discrete dymmetries. 

These qualitative considerations can be made rigorous. Let us consider 
our finite-temperature order parameter, ip(x, to = 0), in d = 2. We will show 
that 

(Dip) = (9.69) 
i.e. the symmetry is necessarily unbroken for d = 2. Define: 

F(k) = J d 2 xe ik x (ip(x)ip(0)) 



F t {k) = J d 2 xe ih - x (ji(x)ip(0)) 

F u (k) = J d 2 xe ik - x (Mx)MO)) (9.70) 

where i = 0,1. The conservation law kiFi = implies that 

Fi(k) = aki5(k 2 ) + eijk jP (k) (9.71) 

This decomposition is clearly special to two dimensions. Substituting this 
decomposition into the definition of Dip, we have: 



(Dip) = a 



(9.72) 
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By unitarity, 

J d 2 xh(x) (ajo(x) + b<p(x)) |0) (9.73) 
has positive norm. From the special cases a = and b = 0, we see that 

J d 2 k\h(k)\ 2 F(k) > 
J d 2 k\h(k)\ 2 F 00 (k) > (9.74) 

positivity of the norm also implies that 



2 



J d 2 k\h{k)\ 2 F{k)^j (^j d 2 k\h{k)\ 2 F 00 (kfj > (^j d 2 k\h{k)\ 2 F (k) 

(9-75) 

If we take an h{k) which is even in x\ - and, therefore, even in k\ - then 
the right-hand-side will be: 

a J d 2 k\h(k)\ 2 k 5(k 2 ) = a J dko\h(k ,\k \)\ 2 (9.76) 

We can make the left-hand-side vanish by making |/i(fc)| 2 sharply peaked at 
very high k where F(k) and Foo(fc) must vanish. Consequently, a = and 
the symmetry is unbroken. The proof works by essentially taking the spatial 
points very far apart in the correlation functions on the left-hand-side. In 
the presence of long-range forces, the left-hand-side need not vanish, and 
spontaneous symmetry-breaking is possible. 

Thus far, our discussion has focussed on thermal fluctuations. Can quan- 
tum fluctuations prevent order at T = 0? In the case of an antiferromagnet, 
the answer is clearly yes. The quantum theory of an d-dimensional antifer- 
romagnet at T = is the same as the classical statistical theory of a magnet 
in d + 1-dimensions. Hence, we conclude that a quantum antiferromagnet 
can order at T = in d + 1 > 2, i.e. in d > 1, but not in d = 1. 

A ferromagnet, on the other hand, can order in any number of dimen- 
sions. The exact ground state of a Heisenberg ferromagnet is a state in 
which all of the spins are aligned. The reason that the above arguments 
about fluctuations do not apply to the ferromagnet is that it has a fluctu- 
ationless ground state. This can be said somewhat differently as follows. 
The order parameter for a ferromagnet, Q(q = 0,oj n = 0) is a conserved 
quantity: the components of Q(q = 0,uj n = 0) are the components of the 
total spin along the different axes. Thus, while it is true that there is very 
little energy cost for a state with reversed spins, such a state will never be 
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reached at T = since the dynamics conserves the the total spin. In the 
case of an antiferromagnet, on the other hand, n is not conserved; hence, 
the dynamics of the system can lead to fluctuations which destroy the or- 
der. At finite temperature, we must average over all of the states in the 
canonical ensemble, so the fluctuations can destroy the ordered state of the 
ferromagnet. To summarize, if the order parameter is a conserved quantity, 
then there can always be order at T = in any d. If it is not, then quantum 
fluctuations can destroy the order at T = 0. In the case of antiferomagnets 
- or phonons - this occurs in d = 1. More generally, it occurs when d+z = 2 
for a continuous symmetry or d + z = 1 for a discrete symmetry. 

9.5.4 Absence of magnetic order** 

9.5.5 Absence of crystalline order** 

9.5.6 Generalizations** 

9.5.7 Lack of order in the ground state** 

9.6 Proof of existence of order** 
9.6.1 Infrared bounds** 



Part IV 



Critical Fluctuations and 
Phase Transitions 
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CHAPTER 10 



The Renormalization Group and Effective Field Theories 



10.1 Low-Energy Effective Field Theories 

In our earlier discussions, we focussed on the low (compared to some cutoff 
A) T, low cj, q properties of the systems at which we looked. Why? The 
principal reason is that these properties are universal - i.e. independent of 
many of the details of the systems. Sometimes universal properties are the 
most striking and interesting aspect of a physical system, but not always 
(certainly not for many practical, e.g. engineering, applications). We like 
universal properties because we can understand them using effective field 
theories. 

Suppose we have a system defined by the following functional integral: 




(10.1) 



with correlation functions: 

<<Xpi) • • • 4>{Pn)) = j V<f> 4>{pi) . . . 4>{p n ) (10.2) 

The long-wavelength, universal properties of the system are determined by 
these correlation functions in the pi — > limit, i.e. in the limit that the p^s 
are smaller than any other scales in the problem. 
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Z contains a great deal of 'extraneous' information about correlation 
functions for large pi . We would like an effective action, S e g , so that 



J V(f) e ~ s ^ (10.3) 



only contains the information necessary to compute the long-wavelength 
correlation functions. The reason that this a worthwhile program to pursue 
is that Z c g is often simple or, at least, simpler than Z. On the other hand, 
this is not a comletely straightforward program because no one tells us how 
to derive Z e g . We have to use any and all tricks available to us (sometimes 
we can find a small parameter which enables us to get Z e g approximately 
and often we simply have to guess. 

At a formal level, we can make the division: 

Mp) = Hp)0(a'-\ p \) 

</>H(p) = <t>(p)0(\p\-A') (10.4) 

so that 

Hp) = Mp) + 0h(p) (10.5) 

where A' is some scale such that we're interested in \p\ < A'. Then 

Z = j Vcp L V<j) H e - s ^L^ H ] ( 10 6 ) 

The effective field theory for long-wavelength correlation functions, i.e. cor- 
relation functions of ^l, is 

Z cS = J V<p L e - s ^ L ] (ia7 ) 
= J Vcp H e-^'**] (10.8) 



where 



S e g[(pL] has cutoff A' 

Occasionally, we will be in the fortunate situation in which 

S[<f> L , <t> H ] = SM + S H \M + XS-^l, cj> H ] (10.9) 

with A small so that we can compute S e g perturbatively in A: 

s cS = sm + xsm + x 2 s 2 [4>l] + ■■■ (io.io) 

In general, we have no such luck, and we have to work much harder to derive 
Sqs. However, even without deriving 5 e ff, we can make some statements 
about it on general grounds. 
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10.2 Renormalization Group Flows 

Let's suppose that we have somehow derived S e ff with cutoff A. Let's call 
it Sa[4>]- Slip] itself may have had all kinds of structure, but this doesn't 
interest us now; we're only interested in Sa [</>]. 
We expand S\[4>] as 

SaM=J>0* (10.11) 

i 

where the g^s are 'coupling constants' and the 0« are local operators. For 
instance, our phonon Lagrangian can be written as: 

S = P 1 + fi0 2 + A0 3 + g0 4 (10.12) 

with 

01 = \ j d T d 3 x(d tUi ) 2 

02 = J drd 3 xuijUij 

04 = ^ J dTd 3 x(d k u k ) 4 (10.13) 

while the NLcM for an antiferromagnet can be written as: 

S = O + gO 1 +g 2 O 2 + ... (10.14) 



with 



= j d d xdr (d^mf 

01 = / d d x dr riid^rii njd^rij 



(10.15) 



We now pick one term in the action - call it 0f ree even though it need not be 
quadratic - and use this term to assign dimensions to the various fields in 
the theory by requiring 0f ree to be dimensionless. For instance, if we choose 
0f ree = 0i in our phonon theory, then [ui] = 1. If we choose 0f ree = 04 
then [ui] = 0. Typically, we choose the term which we believe to be most 
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'important'. If we choose the 'wrong' one (i.e. an inconvenient one) then we 
will find out in the next step. Let's call 5$ the dimension of the field <fi and 
Si the dimension of the operator 0%. <5f rec = by construction. 
We now rescale all momenta and fields by the cutoff A, 

q^qA 

4> -► <j>\ & * 

so that the momenta and fields are now dimensionless. Then 

i i 

Ordinarily, the dimensionless couplings Aj will be 0(1). On dimensional 
grounds, at energy scale u, Oi ~ (u/A) Si , and the action 

saM = x>(|)*, (mi8) 

i 

If Si > 0, this term becomes less important at low energies. Such a term is 
called irrelevant. If Si = 0, the term is called marginal; it remains constant 
as ijj — ► 0. If Si < 0, the term is relevant; it grows in importance at low 
energies. If S e s is simple, it is only because there might be a finite number 
of relevant operators. At lower and lower energies, u <C A, it becomes a 
better and better approximation to simply drop the irrelevant operators. 

Let's formalize this by putting together the notion of a low-energy effec- 
tive field theory with the above considerations about scaling: 

• We have an effective action, S\[(p] and a choice of Of rcc . 

• We divide 

<l>{v) = Hip) 0(bA' -\ P \) + Mp) 9{\p\ - 6A') 
where b < 1. 

• The next step is to obtain (by hook or by crook) S^a 

e -S bA [<p L ] _ J ■jy ( p H e SA[4>L,<t>H] 

• We now rescale 



(10.16) 



q^ qb 
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uj — »■ uib z 

where Q and z are chosen to preserve Of rcc . In general, £ and z will 
depend on the couplings £ = ((gi), z = z(gi). In equilibrium 
classical statistical mechanics, there are no frequencies, so we do not 
need to worry about g; in the theories which have examined thus far 
uj and q are on the same footing, so z is fixed to z = 1. In general, 
however, one must allow for arbitrary z. The rescaling yields S\[4>] 
which also has cutoff A. 

• The physics of the system can be obtained from S\[(f>\ by a rescaling. 
For instance, the correlation length is given by 

(0s A = \ (0s» 

• If 

S A [4>]=J29iOi 

i 

then 

S h M = Y J 9i{b)O i 

i 

where gi(l) = gf. Let b = e~ l . Then we can define flow equations: 

dg r 

which describe the evolution of S\ under an infinitesimal transforma- 
tion. These equations are called Renormalization Group (RG) equa- 
tions or flow equations. 

If we can neglect the . . ., then for Si < 0, grows as £ increases, i.e. 
Oi is more important at low energies (£ — ► oo. For <5, > 0, g^ decreases, i.e. 
Oi is less important at low energies. Of course, the . . . need not be small. 
In fact, it can dominate the first term. In the case of a marginal operator, 
Si = 0, the ... is the whole story. For example, if 

Then 

so g(£) grows as £ grows. 
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10.3 Fixed Points 

If, for some values of the couplings, gi = g* , 

dg 



= (10.21) 

9k=g* k 

then we call gu = a fixed point. At a fixed point, Sa = S A ', so the physics 
is the same at all scales. Hence, 

Z=\z (10.22) 

i.e. £ = oo - the low-energy physics is gapless - or £ = - there is no 
low-energy physics. 

The notion of universality is encapsulated by the observation that differ- 
ent physical systems with very different 'microscopic' actions 5a, S' a , S a can 
all flow into the same fixed point action, S^. If this happens, these systems 
have the same asymptotic long-wavelength physics, i.e. the same universal 
behavior. 

At a fixed point, we can linearize the RG equations: 

±(g i -g*)=A ij {g j -g* j ) (10.23) 
This can be diagonalized to give: 

^jr = w«i ( 10 - 24 ) 

where U{ = Oij(gj — g*). The corresponding operators, Oi = OijOij, 

S A [cj ) }=Y,u l O l (10.25) 

i 

are called eigenoperators. If yi > 0, we say that Ui and Oi are relevant at 
this fixed point. If yi = 0, we say that Ui is marginal. If yi < 0, Ui is 
irrelevant at this fixed point. 

Earlier, we characterized Oi as relevant, marginal, or irrelevant according 
to whether Si < 0, Si = 0, or Si > 0. What this really means is that Oi 
has this property at the fixed point S* = Of TCC . It is possible for a coupling 
constant, g, to be relevant at one fixed point, S^, but irrelevant at another 
fixed point, 5^, as shown in figure 10.1. 
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Figure 10.1: The coupling g\ is relevant at the fixed point on the left but 
irrelevant at the fixed point on the right. 



10.4 Phases of Matter and Critical Phenomena 

If yi < for all i at a given fixed point, then we call this fixed point an 
attractive or stable fixed point. Theories will generically flow into such a 
fixed point. Stable fixed points represent phases of matter. In this course, 
we have already looked at a number of stable fixed points. Our phonon 
theory, 

S = J dtd 3 x£ = \ j dtd 3 f p(dtUi) 2 - - \u 2 kk (10.26) 

is a stable fixed point (you can check that g is an irrelevant coupling) at 
T = for d > 1. This stable fixed point corresponds to a stable phase 
of matter: a crystal. For T > 0, it is stable for d > 2. Our theories of 
non-interacting magnons 

//I Q m 1 _ _, \ 

d d xdr ( 2 m +~Q^ + 2 DVm +' Vm -) (10.27) 

S = j d d xdT{d^nif (10.28) 

are also stable fixed points corresponding, repectively, to ferromagnetic and 
antiferromagnetic phases. The ferromagnet is always stable at T = 0, while 
the antiferromagnet is stable at T = for d > 1. For T > 0, both are stable 
for d > 2. Similarly, XY magnets and superfluid 4 He 

S= j d d xdr {d^ef (10.29) 



154 



CHAPTER 10. THE RENORMALIZATION GROUP AND 

EFFECTIVE FIELD THEORIES 



are phases of matter at T = for d > 1. For T > 0, they are stable for 
d > 2 (d = 2 is a special case). 

The stable phases described above are all characterized by gapless modes 
- i.e. £ = oo which are a consequence of spontaneous symmetry breaking. 
There are also stable phases without gapless modes - i.e. with £ = 0. 
The 4 He action with /j, < fi c (in the saddle-point approximation, [i c = 0) 
describes an empty system with a gap fj, c — [i to all excitations. 

S = j drd d x (V^ + lWf + y(|V| 2 -^) 2 ) (10-30) 

Similarly, (p 4 theory 

J d d x Qk(V^) 2 + \ V + i V) (10.31) 

has two stable phases - ordered and disordered phases - corresponding to 
the fixed points r — > ±oo. At both of these fixed points, £ = 0. Similarly, 
the high-temperature disordered states of magnets are stable phases with 
gaps. 

It makes sense to do perturbation theory in an irrelevant coupling be- 
cause this perturbation theory gets better at low q, uj. Essentially, the ex- 
pansion parameter for perturbation theory is the dimensionless combination 
goj~ Va for some correlation function at characteristic frequency uj. Hence, 
our perturbative calculations of correlation functions in the phonon and 
magnon theories were sensible calculations. Similarly, perturbation theory 
in the coupling u in (p 4 is sensible for d > 4 (the Ginzburg criterion) as you 
showed in the problem set. However, it does not make sense to perturb in 
a relevant coupling such as u in ip A for d < 4. In such a case, the effec- 
tive expansion parameter grows at low q,uj. The low q,uj physics is, in fact 
controlled by some other fixed point. 

If some of the yi > then the fixed point is repulsive or unstable. The 
relevant couplings must be tuned to zero in order for the theory to flow 
into an unstable fixed point. Unstable fixed points represent (multi-)critical 
points separating these phases. The unstable directions correspond to the 
parameters which must be tuned to reach the critical point. Superfluid 4 He 

S = j drd d x (V ^ + |Wf + U(jVf - ^) 2 ) (10.32) 

has a critical point at [i = /U c . The corresponding fixed point is at ji = fx*, 
V = V*. This critical point separates two stable phases: the superfluid 
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Figure 10.2: The flow diagram of a critical point C and two stable fixed 
points Si,S2 which it separates. 

and the empty system. There is one relevant direction at this fixed point. 
By tuning this relevant direction, we can pass from one phase through the 
critical point to the other phase. Similarly, the Ising model has a fixed point 
with one relevant direction which we discuss in a later section. By tuning the 
relevant coupling, we can pass from the ordered state through the critical 
point and into the disordered state. Figure 10.2 depicts the flow diagram 
for a critical point and two stable fixed points which it separates. 

10.5 Infinite number of degrees of freedom and the 
nonanalyticity of the free energy** 

10.5.1 Yang-Lee theory** 

10.6 Scaling Equations 

Let us consider the the implications of this framework for physical quantities. 
Suppose C{pi,g.j) is some physical quantity such as a correlation function 
of n fields (p. It will, in general, depend on some momenta pi and on the 
coupling constants of the system. Suppose that the couplings are all close 
to their fixed point values, g± rs g*, so we will write C as C(pi,gi — g*) and 
suppose that the linearized flow equations read: 

^(9i-g*) = \ (9,-9*) (10.33) 

Then we can perform an RG transformation, according to which: 

C (pi, gi - g*) = b~<* C (|, ( 9i - g*) (10.34) 

Suppose that we are in the vicinity of a stable fixed point, so that all of 
the Xi < 0. Then, in the b — »■ limit 



C(p i ,g i -g*)=b-<*c(^,o) 



(10.35) 
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If, for instance, we are interested in the two-point correlation function at 
low p, we can take b = p and: 

C(p, gi -g*)^p- 2 t*C (1,0) (10.36) 

A similar result follows if we are in the vicinity of an unstable fixed point, 
but we have set all of the relevant couplings equal to zero. This scaling 
relation may seem to contradict simple dimensional analysis, which would 
predict C (p, gi — g*) ~ p~ 2S <t> . In fact, there is no contradiction. The missing 
powers of p are made up by the dependence on the cutoff: 

C(p, gi -g*)^p-^A-^C (10.37) 

These observations can be reformulated as follows. Consider a corre- 
lation function C n (pi,gi) of n ip fields. Up to a rescaling, this correlation 
function is equal to its value after an RG transformation: 

C n ( Pi ,gi) = (V)" C(9l) C n ( Pi e e , gi (e)) (10.38) 

The left-hand-side is independent of £, so differentiating both sides with 
respect to t yields the RG equation: 

^ + n (s v - \r) (#)) + Pi (9j) ^) C n ( Pi e e , 9i {i)) = (10.39) 
where 

26 V - V ( 9j ) = f e (ef^ (10.40) 

6 V is the naive scaling dimension of ip. rj is called the anomalous dimension 
of ip. The (3 functions are the right-hand-sides of the flow-equations for the 
couplings: 

Ate) = ^r (10 - 41) 

At a fixed point, the /^-functions vanish, = and rj is a constant, 

28 v -r) = 2((g*) (10.42) 

so the RG equation reads: 

§- £ ~ n (Sy - ± v yj C n ( Pl e- e ) = (10.43) 

In other words, the correlation function is a power-law in pi with exponent 
n5 v — nrj/2. 
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Suppose, instead, that we are near a fixed point with one relevant direc- 
tion. Call this coupling u and the other irrelevant couplings <?j. Then, 

C ( Pl , U - U *,g t -g*)= b-<' C (pi/b, (U - U*) - 9 *) b^) 

(10.44) 

If u — u* is small, then we can take b = (u — u*) l / Xu and be in the 6^0 
limit: 

C ( Pi ,u - u\ 9i - g*) - 1 — F7r C ( Pl (u - u*)- 1/K A, 0) (10.45) 

or 

C( Pi ,u-u*, gi -g*) -+ L^ F ( Pi (u-uT 1/K ) (10.46) 

where F(x) is called a scaling function. If the stable phase to which the 
system flows is trivial - i.e. has a gap - then C (pi(u — u*) -1 / Au , 1, 0) can 
be calculated and does not have any interesting structure. The non-trivial 
physics is entirely in the prefactor. 

If we are interested in a correlation function at pi = such as the mag- 
netization of a ferromagnet, then we can write: 

Co (u - u*, 9l - g*) -+ 1 — F7r F (( 9i - g*) (u - u*r V M (10.47) 

Now imagine that there is a second relevant coupling, g, or, even that g 
is the leading irrelevant coupling (i.e. the least irrelevant of the irrelevant 
couplings) so that we are interested in the g dependence of the correlation 
function. Then, setting the other couplings to their fixed point values in the 
small u — u* limit: 

C (u-u*, 9i - g*) - L_ F ( 9i ~ 9 [. /x ) (10.48) 

( n _ u *) n( ./ A « \{u — u*) A i/ A «y 

10.7 Analyticity of (3- functions** 

10.8 Finite-Size Scaling 

Temperature plays a very different role in classical and quantum statisti- 
cal mechanics. In the classical theory, temperature is one of the couplings 
in the theory. The temperature dependence of physical quantities can be 
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determined from the scaling behavior of the temperature. Classical statisti- 
cal mechanics can be used to calculate a correlation function at wavevector 
the temperature is larger than the important excitation energies since the 
n^O Matsubara frequencies can then be ignored. In the quantum theory, 
temperature is the size of the system in the imaginary time direction, and 
the temperature dependence of physical quantities can be determined from 
finite-size scaling which we discuss below. Finite-size scaling can be used in 
the limit in which j3 is large. An alternative, related way of dealing with 
finite-temperature is dicussed in the context of the NLtrM in the last section 
of this chapter. 

Finite-size scaling is also useful for dealing with systems which are finite 
in one or more spatial directions. Since numerical calculations must be 
done in such systems, finite-size scaling allows us to compare numerics to 
analytical calculations which are more easily done in the infinite-size limit. 

Since renormalization group equations describe the evolution of effective 
Lagrangians as one integrates out short-distance physics, it is clear that 
these equations are insensitive to finite-size effects so long as the finite- 
size is much larger than the inverse of the cutoff. While the equations 
themselves are unchanged, the solutions are modified because they depend 
on an additional dimensionful parameter, namely the size of the system (in 
our case, (3). For simplicity, let us consider a theory with a single relevant 
coupling (say, 4> 4 theory), which satisfies a renormalization group equation 
with a low-energy fixed point: 

{~k + M Wg + n {** " l V i9) ) ) G(n)fee '' 9 ^ Le ~^ = ° (1 °- 49) 

G {n) 

is an n-point Green function, L is the finite size of the system. We may 
take e — L, and we find 

G^ n \ Pi ,g,L) = L n ^-^^)G {n \ Pi L,g{\nL),l) (10.50) 

Then in the large-size limit, L — > oo, we have g(lnL) — > g* . As a result, we 
have the scaling form: 

GW(pi,g,L) = L«(^4ife))GW( K L l5 *,l) (10.51) 

We will be primarily concerned with the case in which the finite size, L, 
will be the inverse temperature, 0, so (10.51) will give the temperature 
dependence of Green functions in the low-temperature limit. 



G^(pi,g,/3) ~^(^4i) G W W)ffV) ( io.52) 
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J/T=0 J/T = 00 

Figure 10.3: The flow diagram of the ID Ising model. 

10.9 Non-Perturbative RG for the ID Ising Model 

In the next two sections, we will look at two examples of RG transformations, 
one non-perturbative and one perturbative. First, we look at the ID Ising 
model, 



Our RG transformation will be done in 'real space' by integrating out the 
spins on the even sites. This procedure is called decimation. Whereas the 
original model has wavevectors —it /a < k < ir/a, the resulting theory has 
—it /2a < k < it /2a. We can then rescale momenta by 2 to obtain an RG 
tranformation. 

0i=±l i 

= yi n ( 2 c ° sh t ^ 2i+i + CT2 ( i+i ) +i ) ) 

<T2+i=±l i ^ 

= n^ e ^ ,<T2i+1<T2(l+1)+1 (10.54) 

iT2+l=±l i 



where K = 2e(i') and 



£\ = \ In cosh 2^ (10.55) 



This RG transformation has only 2 fixed points, y = and y = oo. y is 
relevant at the ^ = oo fixed point but irrelevant at the ^ = fixed point. 
The flow diagram is shown in 10.3. This flow diagram shows that for any 
T > 0, the system is controlled by the disordered y = fixed point. Only 
at T = can the system be ordered. 
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10.10 Dimensional crossover in coupled Ising chains** 

10.11 Real-space RG** 

10.12 Perturbative RG for </ Theory in 4 - e Di- 
mensions 

Our second example is (p theory. 



/ 



d d q 1 2, , m2 , f d d q 1 2 



(2ir) d 2 * J (2ir) d 2 

u f dSi d d q 2 d d g 3 



We take the first term as Of ree - Under a rescaling q — > (76, we must take 

93^99 6"^ (10.57) 

Using this rescaling, we immediately see that the leading terms in the RG 
equations are: 

dr 

— = 2r + ... 

di 

nil 

_ = (4-d)u + ... (10.58) 

We immediately find one fixed point, the Gaussian fixed point: r = u = 
0. At this fixed point, r is always relevant while u is irrelevant for d > 4 
and relevant for d < 4. You will recognize that this is the same as the 
Ginzburg criterion which determines when the saddle-point approximation 
is valid for this theory: the saddle-point approximation is valid when the 
quartic interaction is irrelevant. When the quartic interaction is irrelevant, 
the correct theory of the critical point is simply. 

s= S (0^ 2| ^ )|2 (io - 59) 

which has critical correlation functions 

(^>(°))~up ( 10 - 6 °) 



The one relevant direction at the Gaussian fixed point in d > 4 is the 
temperature, r. At the Gaussian fixed point, r has scaling dimension 2. 
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Hence, £ ~ V\A"- As we discussed in the context of the Ising model, r ~ 
T-T c . Hence, 

« ~ iFI^75 < 10 - 61 > 

Of course, we should also allow (p & , 92 s , etc. terms. If we don't include 
them initially in our action, they will be generated by the RG transforma- 
tion. However, the 93 s operator is only relevant below 3 dimensions, the (p s 
operator is only relevant below 8/3 dimensions, etc. Hence, for d > 3, we 
can ignore the higher order terms in the asymptotic q — > limit because 
they are irrelevant. (Actually, we have only shown that they are irrelevant 
at the Gaussian fixed point; in fact, they are also irrelevant at the fixed 
point which we find below.) 

For d < 4, the Gaussian fixed point has two unstable directions, r and u. 
We can compute the RG equations to the one-loop level to find other fixed 
points. We make the division of cp into cpi, and cpjj and integrate out cpjj at 
the one-loop level. At a schematic level, this works as follows: 



where 



q r 1 f bA d d q 1 2 2 f bA d d q 1 2 

u f bA d d gi d d q 2 d d q 3 
+ 4! J (2^(2^(2^^ {<ll) VL fe) VL te) VL 

s ^ = Lwf \ q2 ^ 2 + Lwf \ r ^ 2 ^ 

a r 1 u f d d qi d d q 2 d d q 3 

Sint[<PL, <PH\ = -fi J J^y J^y J^y '■fH ¥>ff (ffi) <fH (ffi) M 

+ V'^^WV d(fH {qi) VH {q2) VL iq3) Hm5) 

S'int also contains ^Ph^h^h^l and ^plPlPl^h terms, but the phase space 
for these terms is very small since it is difficult for three large momenta to 
add up to a small momentum or the reverse. Hence, we can safely ignore 
these terms. Expanding perturbatively, there is a contribution of the form: 

e -s> A [ VL ] = e -s L [<p L ] j VipHe -s° H [f H ] (1 _ 
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(a) 




(b) 





Figure 10.4: The one-loop diagrams which determine the RG equations for 
(a) r and (b) u. 



u 
4 



f d d qi d d q 2 d a q 3 

J VH {qi) v * {q2) ^ fe) ^ {qi) 



+ 0(u 2 ) (10.66) 



We can do this diagrammatically by computing one-loop diagrams with 
internal momenta restricted to the range 6A < \q\ < A. The external legs 
must be (pi fields, i.e. must have momenta q < bA. (Note that the contribu- 
tion to S;,a is the negative of the value of the diagram since we are absorbing 
it into e^A^il.) The contribution described above results from the first- 
order diagram with two external legs. Such diagrams give a contribution to 
SbA of the form: 

/ Wf c{q) ^ L{q)] ? (10 - 67) 

where c(q) = Co + c 2 q 2 + . . .. Diagrams with four external legs give a contri- 
bution to SbA of the form: 

1 f d d q x d d q 2 d d q 3 

4! J (2^p (2^p (2^p V {qi,q2 ' Q3) * {qi) * fe) * fe) ^ { ~ Ql ~ q2 ~ Q3) 

(10.68) 

The one-loop contribution to the RG equations is given by the diagrams 
of figure 10.4. The diagram of 10.4(a) gives the contribution of (10.66), 
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namely 

1 f l d d q 1 f d d p 1, , 

Dropping higher-order terms in r (because we are interested in the vicinity 
of the Gaussian fixed point, r = u = 0), we can rewrite the integral as 

1 d d q 1 f 1 d d q ]__ r f 1 d d q 



(2vr) d g 2 +r J b {2n) d q 2 J b (2n) d q 2 (q 2 + r) 
[ 1 J*q_±_ ? d d q 1 2 
"A (2tt)<V r J 6 (27r) ^4 + t A r 



27T2 1 



(27r)^r (I) d-2 

d 

2tt2 1 
■ r 



1 - b d ~ 2 ) 

1 - 6 d " 4 ) (10.70) 



(27r)T(g)'d-4 

Meanwhile, the three diagrams of figure 10.4(b) each give a contribution 
1 2 f 1 d d q 1 /" d d g 2 A3 1 



"2 U X J^WTry 1 J^J^ d (2^^ v{qi)ip{q2)v{q3)v{ - qi - q2 - q3) 

(10.71) 

or, adding them together and evaluating the integral in the r = limit, 



3 2 1 1 fi ,d-i\ f Ai d d q 2 d d q 3 1 

2" (2 7 r)lr(|) J 7 (^F (2^(2^4!^^ ^) ^3) ^ M 

(10.72) 



Observe that there is no one-loop contribution to the 



/ 



(2ir) d 

term. Hence the correct rescaling is still 



t/,;,/ <rV(?)| 2 (10.73) 



^^cpb-^ (10.74) 



As a result, we find the one-loop RG equations: 

d 

r + dr = b- 2 (r+-u 2 J* — ( 1 - ft d ~ 2> ) 



2 (27r)T(g)d-2 

1 2vri 1 , 
- — ur 7-7- l — o 

2 {2Tt) d T {±) d - A 
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\ 



Y* — 



U' ! '= 



-8/6 , 

16tt 2 £/3 



r*= u*= 



Figure 10.5: The flow diagram of a theory in 4 — e dimensions. 

d 

u + du = b d ~\u --u 2 27T \^ -J— f 1 - 6 d " 4N ) ) (10.75) 

Writing b = e~ M , and taking the limit of small e = 4 — d we have: 

dr 1 1 

^ = 2r+ 16^-16^ Ur + --- 

3 , . . 

«= e "-16^" + "' <la76) 
The corresponding flow diagram is shown in figure 10.5. These RG equations 
have a fixed point at 0(e): 

1 

r = --e 

»* = ^e (10.77) 



At this fixed point, the eigenoperators arc: 

£ (r -0=(2-i t )(r-0 

^(n-u*) = -e (n-u*) (10.78) 

There is only one relevant direction (corresponding to the temperature) with 
scaling dimension 1/v = 2 — |e. The correlation length scales as: 

f-p^rp (10.79) 
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At the critical point, the correlation function has the power law decay 

(^>(0))~^ (10.80) 

At order e, rj = 0, as we have seen. However, this is an artifact of the 
0(e) calculation. At the next order, we find a non- vanishing contribution: 
rj = e 2 /54. 

For e small, our neglect of higher-loop contributions is justified. To 
compute in d = 3, however, we must go to higher loops or, equivalently, 
higher-order in e. 

Several remarks are in order: 

• As we showed in chapter 8, the Ising model can be mapped onto a tp A 
theory if higher powers of ip are neglected. We can now justify their 
neglect: these terms are irrelevant. 

• There are many different ways of implementing the cutoff, i.e. regular- 
izing a theory: by putting the theory on a lattice (as in the ID Ising 
model above), by introducing a hard cutoff (as in 92 4 theory above), 
or by introducing a soft cutoff - e.g. by multiplying all momentum 

2 / A 2 

integrals by e~ q ' - just to name a few. 

• Corresponding to these different regularization schemes, there are dif- 
ferent renormalization group transformations, such as real-space deci- 
mation on the lattice or momentum shell-integration for a momentum- 
space cutoff. Since all of these different cutoff theories will flow, ulti- 
mately, into the same fixed point under the different RG transforma- 
tions, it is a matter of convenience which scheme we choose. 

• Both RG equations and the fixed point values of the couplings are 
scheme dependent. The universal properties, such as exponents, are 
scheme independent, e.g. in the RG equation 

% = X g (9-9*) + ... (10.81) 

X g is scheme independent and independent of all microscopic details, 
but g* is scheme-dependent, and can depend on microscopic details 
such as the cutoff. 

• Integrals which are logarithmically divergent at large q are propor- 
tional to In b and are independent of the cutoff A. Consequently they 
are scheme independent. Integrals which are more strongly ultra-violet 
divergent are scheme-dependent. 
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• The term in the RG equation for r which is independent of r determines 
only the fixed point value of r, i.e. r*. It does not affect the scaling 
exponents. In fact, it is renormalization scheme dependent; in some 
schemes, it vanishes, so it can be dropped and we can work with: 

dr n 1 

du 3 9 

M =e "-T^ tt+ - < la82 > 



10.13 The 0(3) NL<tM 

In order to study the phase diagram of a quantum system we would like to 
consider both T = and finite T in the same phase diagram. In particular, 
we would like to consider even the high temperatures at which classical phase 
transitions can occur. Finite-size scaling - which is useful for asymptotically 
low temperatures - is not appropriate for such an analysis. Instead, we 
carry out the renormalization group transformation directly on the finite- 
temperature quantum-mechanical functional integral. We will show how 
this is done for an antiferromagnet: 

J Vne~^ dd£ ^ dT{d ^ )2 (10.83) 

The requirement of 0(3) symmetry together with the constraint n 2 = 1 
implies that we can add to this action irrelevant terms such as 

J d d xdr J^dr (d^n ■ d^nf (10.84) 

but no relevant terms for d > 1. 

As usual, we rewrite the action as: 

S = f d^ fdr ( [d^f + g (10.85) 
7 Jo V 1 - 9 n i n i ) 

We now define an RG transformation in which we integrate out ni(q,co n ) 
with wavevectors e~ e A < \q\ < A but arbitrary Matsubara frequency oo n . 
This is different from the RG which we defined earlier, but it is still per- 
fectly well-defined. In the evaluation of diagrams, the internal momenta are 
restricted to the shell e~ l Aq < A, but the Matsubara frequencies can run 
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t 




Figure 10.6: The flow diagram of an antiferromagnet in d > 2. 

from n = — oo to n = oo. However, in the rescaling step, we rescale both 
momenta and frequencies: 

q^qe 

co n -► uj n e~ e (10.86) 
The second equation means that the temperature is rescaled: 

P -»• /3e e (10.87) 

n« must also be rescaled: 

m -► nie -£C (10.88) 
In the problem set, you will compute the one- loop RG equation for g. It is: 

d A = (i _ d) g + I ( ^ <? 2 coth f (10.89) 

^ v ;y 2 \ v (2^) d r(f) y y 2 v 7 

/3 changes trivially since it is only affected by the rescaling. 

I—/, (10.90) 

Hence, if we define the parameter t = g/(3, we can write a scaling equation 
for t: 

de\(3J 0d£ 9 de\p 
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t 




Figure 10.7: The flow diagram of an antiferromagnet in d = 2. 

= 4 I (1 - d) g + 2 f 2 . g 2 coth - ] + ^ 

\ 2(2vr) d r(f) y 2 y 

d 

= (2-d)?- + - 2 *\ .. coth^ (10.91) 

V 7 2 (2vr) d r (| ) y /? 2 v ; 

In other words, we can rewrite the RG equation for g and the trivial rescaling 
for as the two equations: 

% = il-d) g + \{ 2 f (d \ g 2 cothA 
di V ,y 2 ^(27r) d r(|) y y 24 

^ = (2-d)t + -[ 2 J' I fftcoth-^- (10.92) 

di V ' 2 ^(27r) d r(f) y ^ 2/ V ; 

At zero temperature, t = 0, the first equation shows that there is a stable 
fixed point at g* = for <i > 1. This is the antiferromagnetically ordered 
phase. For t, g small, the system flows into the g* = t* = fixed point. We 
will discuss the basin of attraction of this fixed point below. 

There is an unstable fixed point at 

* - C- 1 ^™ (10,3) 

7T2 



For g > g c , g flows to g = oo. At this fixed point, the antiferromagnet is 
disordered by quantum fluctuations. Such fixed points are called quantum 



10.13. THE 0(3) NLo-M 



169 



critical points, g can be varied by introducing a next-neighbor coupling J' 
which frustrates the nearest-neighbor coupling J. Increasing J' increases g. 
At finite temperature, t > 0, there is a fixed point for d > 2 at, 

g*=0 

t* = t c = '-^-jr ^ (10.94) 

2tt2 

For d < 2, there is no fixed point at finite temperature; all flows go to t = oo. 
The flow diagrams are shown in figures 10.6 and 10.7. 

The region underneath the dark line in figure 10.6 is the antiferromag- 
netically ordered phase controlled by the g* = 0, t* = fixed point. For 
given g - i.e. for a given system - there is a range of t for which the system 
is antiferromagnetically ordered. This range of t translates into a range of 
temperatures, < T < T c . For g — ► 0, T c — > oo. 

At both the zero and finite-temperature fixed points, the correlation 
functions exhibit power-law decay. As these fixed points are approached, the 
correlation length diverges. In the zero-temperature case, the divergence is: 

£ ~ \g - 9cV Vd+1 (10.95) 

while, at finite temperture, it is: 

i~\t-t c \~ Ud (10.96) 

In the problem set, you will calculate v&. 

At the finite-temperature critical point, the correlation functions have 
power-law decay: 

faffing)) --^L^Sij (10.97) 
while at the zero-temperature critical point, they decay as: 

(n,(x, r) n,(0)) ~ ^ + T ^ l+m+l)/2 ^ (10-98) 

In the problem set, you will calculate 

To summarize, in d > 2, an antiferromagnet described by the 0(3) 
NLuM exhibits the following physics: 

• An antiferromagnetic phase controlled by the g* = t* = fixed point. 
This phase is characterized by an 0(3) symmetry which is sponta- 
neously broken to U(l), 

(ft) / (10.99) 

yielding two Goldstone modes. 
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• A zero-temperature quantum critical point g* = g c , t* = character- 
ized by power-law correlation functions: 

(n i (x,T)n i (0)) ^— —5a (10.100) 

\ H > 2 + r 2)(d-l+»/d+i)/2 lJ 1 ; 

• A zero-temperature paramagnetic phase controlled by a fixed point at 
g* = oo, t* = and characterized by exponentially-decaying correla- 
tion functions: 

e-M/C 

(n 4 (x,r)n,(0)) ~ — <% (10.101) 

(a; + r ) 

As 5 C is approached at t = 0, the correlation length diverges as: 

£~b-2 c r" d+1 (10.102) 

• A finite-temperature critical point at t* = t c , g* = characterized by 
power-law correlation functions: 

(n^n.m^-^-L-^ (10.103) 

Near 4 dimensions, this critical point can can be studied with an 0(3) 
((p a (p a ) 2 theory using a an e = 4 - d expansion. 

• A finite-temperature paramagnetic phase controlled by a fixed point 
at t* = oo, g* = and characterized by exponentially-decaying corre- 
lation functions: 

e -M/€ 

<n i (£)n J -(0))~-^ =r (5 ij - (10.104) 
As t — > i c , the correlation length diverges as; 

i~\t-t c \~ Vd (10.105) 

In d = 2, we have an antiferromagnetic phase only at T = for < g < 
g c . The system is paramagnetic in the rest of the phase diagram. 

The calculations of this section and the problem set are all to lowest 
order in al — 1 at zero-temperature and d — 2 at finite temperature. In the 
next section, we will generalize the 0(3) NLcM to the O(N) NLerM and 
derive the RG equations to lowest order in 1/N. 
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(a) 







Figure 10.8: (a) 0(1) diagrams and (b) 0(1/ N) diagrams with two external 
legs. 



where a = 1, 2, . . . , N. For N = 1, this theory has the Z 2 symmetry of the 
Ising model. For N = 2, the theory has the 0(2) = U(l) symmetry of 4 He 
or an XY magnet. For arbitrary N, the action has O(N) symmetry. The 
RG equations simplify for N — > oo as we now show. 

Let's classify diagrams according to powers of N. Each vertex gets a 
factor of 1/N. Every time we sum over an index a, we get a factor of N. 
First, let's consider the diagrams with two external legs. These diagrams 
renormalize r (and, possibly, £)• Figure 10.8a contains some O(l) two-leg 
diagrams. 

Let's now turn to the diagrams with 4 external legs. Figure 10.9a con- 
tains some 0(1/ N) diagrams with 4 external legs. Other diagrams, such as 
that of figure 10.9b are down by powers of 1/N. 

To organize the diagrams in powers of 1/N, it is useful to perform a 
Hubbard- Stratonovich transformation. We introduce a field a and modify 
the action by adding a term: 



10.14 Large N 



Suppose we generalize ip 4 theory to 




(10.106) 




(10.107) 
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Figure 10.9: (a) Some 0(1 /N) diagrams and (b) an 0(1/ N 2 ) diagram with 
four external legs. 



Since the action is quadratic in a, we could integrate out a without affecting 
the functional integral for ip a . However, it is also possible to expand the 
square, which leads to the action: 

S = J d d x (^V<p a V<p a + ^rtp a <p a - ^a 2 + ^a<p a <p a ^J (10.108) 

Notice that integrating out a restores the ((p a (p a ) 2 term. 

This is now a quadratic action for (p a . Hence, we can integrate out (p a : 

S«W = j -^yi ("|^ 2 + ™n(V 2 + r + a)) (10.109) 

Dropping a constant, the logarithm can be expanded to give: 
d d p N 



(10.110) 



Since there is a factor of TV in front of S e g [a] , each a propagator carries 
a 1/N, while each vertex carries an N. Hence, a diagram goes as pj v ~ I ~ E = 
jy-E+i-L _ rj-^g i owes ^ order in 1/N for the -E-point a correlation function 
is a tree- level in S e s [a] diagram. To compute the (p a fc-point correlation 
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function, we need to compute a diagram with k/2 external a legs. Hence, 
the lowest order in 1/N contribution to the ip a two-point correlation function 
is obtained from the a one-point correlation function (which is determined 
by the diagram obtained by joining the one-point function to one end of 
the two-point function). It is 0(1) and it is given by the graph of figure 
??a. The lowest order in 1/N contribution to the (p a four-point correlation 
function is obtained from the a two-point correlation function. It is 0(1/N) 
and it is given by the graphs of figure ??b. 
Since the a one-point function is: 



N 



J 



d d q 1 
(2vrp q 2 + r 



while the a two-point function at zero momentum is 

d d q 1 
(2^F {q 2 +rf_ 



2u J 



-, -1 



(10.111) 



(10.112) 



we have: 



r + dr = b r + 



(2tt)< 



q 2 +r 



2u iV Jb 



1 



u + du = b 



d-4 



\2u 



(2n) a 

d d q 



(q 2 +rY 



1 



(2jr) d (,2 + r ) 



(10.113) 



Differentiating these equations, we obtain the RG equations: 
dr _ 1 



dl 
du 



2r 



e u 



16vr 2 
3 



ur + 



u 2 + 



(10.114) 



dl " " 167T 2 

In other words, the one-loop RG equations contain the same information as 



the geometric series of O(l) and 0(1/ N) diagrams! In the iV - 
the one-loop RG equations are valid even when e is not small. 
We can also consider the O(N) generalization of the NLuM: 



/ 



where n is an iV component vector. Imposing the constraint n 2 
Lagrange multiplier, we have: 



oo limit, 



(10.115) 



1 with a 



S = J dd xJ^dr ((Vn) 2 + A (n 2 - l)) (10.116) 
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Integrating out n, we have: 



S = ^- [ d d x [ dr f-^A(n 2 -l) + ]-NTr In (- V 2 + A(f)$l)0.117) 
2# J Jo \2g 2 / 

In the N — > oo limit, the saddle-point approximation becomes exact, so: 

N y 1 1 (lens) 



(2^) d uj 2 +q 2 + \ g 
Let's specialize to the case T = 0: 

at/ a ^^L I = I ( 10 ii9) 

7 27r (2ir) d uj 2 + q 2 + \ g {W.llH) 

This integral equation can be solved using the RG transformation. First, 
we integrate out momenta 6A < \q\ < A, assuming that A << A 2 : 

f b du d d q 1 f 1 ^ d d <7 1 1 , 

N / — ^-^r ~a 5 r + iV / — — -= = = - (10.120) 

7 2tt (27r) d uj 2 + q 2 + \ J b 2vr (2^) d + q 2 + A 5 V ; 

or, 

d 



N I- Mi , \ 2 -fm t~ — f i " ^) = ~ (10-121) 

J 2tt (2n) d oj 2 + q 2 + A (27r) d r (| W - 1 V / ff v ; 



If we bring the second term on the left-hand-side to the right-hand-side, we 
have: 



J 2vr (2vr) d uj 2 + q 2 + A g (2ir) d T (0 d - 1 V / 

(10.122) 

Rescaling the momenta in the integral, q ^ qb, u ^ tub we have: 

J 2tt (2ir) d uj 2 + q 2 + \b 2 \g {2v) d T (f ) d - 1 V V 1 J 

(10.123) 

In other words, 

- + d(-)= (--* "f^ -j— — f 1 - b*-i) ) (10.124) 
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writing b = e , this gives: 

d 

^ = ( d - 1) g - N 2 f (10.125) 

Again, the large- N RG equation is essentially a one-loop RG equation. 

As we will see again in the context of interacting fermions, the large- N 
limit is one in which RG equations can be calculated with minimum fuss. 



10.15 The Kosterlitz-Thouless Transition 

We turn now to the RG analysis of an XY magnet or, equivalently, 4 He at 
zero-temperature in ID 

s =\ps j ' drdxidpOf (10.126) 
or at finite temperature in 2D, 

We will use the notation 

S =^K J d 2 x(d fl 9) 2 (10.128) 

to encompass both cases. This is an 0(2) non-linear sigma model with 
K = 1/g. Clearly, dK/di = to all orders in K. 

The two-point correlation function of the order parameter may be cal- 
culated using: 



e i0(x) e -tf(0) 



J <£>0 e f d 2 x(± K(d»8) 2 +j(x)e(x)) (10.129) 



where 

J(y) = iS(y - x) - iS(y) (10.130) 

Hence it is given by, 

9{x) -i0(O) \ _ \$d 2 xd 2 x' J(x)G(x-x')J(x') 



_ p G(x)-G(0) 



e uw-u W (10.131) 
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Now, 



G(x) - G(0) = (6(x)9(0)) - (6(0)6(0)) 

d q (Jq-x _ i\ J_ J_ 
(2tt) 2 { > K q 2 

1 f A d 2 q 1 



K J± (2tt) 2 q 2 

= —\n\x/a\ (10.132) 

where a = 1/A is a short-distance cutoff. Hence, 

e i0(x) e -i0(o)\ = _L_ (1(U33) 

' |x|27Tif 

Similarly if J2i n i = 0> 

j«E,«.fi(^i)^ _ e - 2FKEi,j««% ln l^-^l (10.134) 

In other words, the correlation function has the form of the Boltzmann 
weight for a Coulomb gas. 

Thus far, we have neglected the periodicity of 6, i.e. the fact that < 
6 < 2ir. However, for \x\ large, 

e -i<(«Cr)-«(0)) 2 > = _L_ (10135) 

tells us that (9(x) — 6(0)) 2 becomes large for \x\ large. This means that 6 
must wind around 2ir, i.e. that there are vortices. 

A vortex is a singular configuration of the field 9(x) such that the vector 
field d ll 6(x) twists around an integer number, n, times as the vortex is 
encircled. In the context of 4 He, a vortex is a swirl of current. In an XY 
magnet, it is a point about which the spins rotate. In other words, 

d il 6dx ii = c l-Kn (10.136) 

p 

for any path, P, which encloses the vortex, n is the winding number of the 
vortex. 

We can understand this qualitatively by calculating the contribution of 
a vortex configuration to the functional integral. If there is a vortex at the 
origin with winding number n, then (10.136) implies that 

27T77 

d^6 (10.137) 
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So the action of a vortex at the origin is: 

= vrifn 2 In - + E c (10.138) 



a 

where R is the size of the system, a is the size of the core of the vortex and 
E c is the core energy. Meanwhile a vortex-anti-vortex pair separated by a 
distance r has energy 

A pair = ttK In ^ + 2£ c (10.139) 

To calculate the contribution of a vortex to the functional integral, we 
must take into account the fact that the vortex can be placed anywhere in 
the system. Hence, the contribution to the functional integral is proportional 
to the area of the system: 



Z„ ~ I — ) e 



a 



3 (2-7rK)lnf 



(10.140) 



For K < 2/-7T, this is a large contribution, so vortices can proliferate. The 
proliferation of vortices destroys the power-law correlation functions. 

Let us now study this transition more systematically. We break 9 into a 
smooth piece, 9 S , and a piece that contains the vortices 9y, 



9 v (x) = arctan ( ^ X *] 2 ) (10.141) 

t \( x - x i)iJ 



where the i th vortex has winding number n\ and position Xj. Using 

me^- -f 

i {x - Xi) 

= y^nje^d/j, In \x- Xj\ (10.142) 

i 

we can rewrite the action as (the cross term between 9 S and 6y vanishes 
upon integration by parts) 



s = \k j d 2 x (d^ey 
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= \k j d 2 x [dp (e s + e v )f 

= x -k J d 2 x (d^e s ) 2 + J d 2 x (d^e v f 

= — K J d 2 x {d^Os) 2 + - K j d 2 x~^\iinjd^\n\x — Xi\d^\n\x — Xj\ 

hi 

= -K I d 2 x {d^Osf - 2itK Yj^nj In \x { -Xj\ + n v E c (10.143) 
J hi 

In the last line, we have restored the core energies of the n v vortices. 
Hence, the partition function is: 



z = Jve s e-^ K f d2x(d ^ 2 x 

h nfr +2 J ii iV 1 !iV_ 1 !iV 2 !Ar_ 2 !... d x * e 

n v =0 i=l nj=0,±l,±2,... i 

(10.144) 

where is the number of vortices of strength k in a given configuration. 
Observe that (10.134) implies that this can be rewritten as: 

Z[<j>] = j VOsVfc-S**^^^^™™*) 6-3^/^(^145) 

where y m = e~ nvEc is the vortex fugacity. The perturbative expansion of 
Z[4>] function is the sum over all vortex configurations of Z[9\. Expanding 
perturbatively in the y^s and using (10.134), we have: 

n v =0 i=l m=0,±l,±2,... i L ' 1 ' z ' z 

(10.146) 

Integrating out 9 S , we are left with: 

Z = J V4>e~ $ d2x (l^K^ 2+ ^™y™ cosm ^) (10.147) 

Notice that we have transformed the partition function for the vortices 
in the field 6 into the partition function for another scalar field, (j>. This is 
an example of a duality transformation. The action for is a sine- Gordon 
model. Let us consider the cos m<fi term in the action: 

S= j d 2 x (J^(d^) 2 + Y,y m cosmA (10.148) 
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Is this term relevant or irrelavant? At y m = 0, we can determine the first 
term in the RG equation for y m from its scaling dimension. This can be 
determined from the correlation function: 

(cos 9{x) cos 0(O))~^L_ (10.149) 

which tells us that cos rrufr has dimension irmK. Hence, the RG equation 
for y is: 

^ = (2-irmK)y m + ... (10.150) 

Consequently, y = y\ is the most relevant operator. As K is decreased, y\ 
becomes relevant first - i.e. at K = 2/tt. Let us, therefore, focus on the 
action with only the m = 1 term: 

S = Jd 2 x (d^f + y cos <frj (10.151) 

In order to study the flow of K resulting from the presence of y, let us 
expand the functional integral perturbatively in y. 

Z = J V4>e~ / d Ml^^<« 2+s/cos ^) 

= . . . + J V<f> J d 2 x cos 4>(x) J d 2 y cos <f>(y)\ e 1 * x {& ^ ? ) + 

+ ! VmV- I d 2 xcos(f){x) I d 2 y cos^(y) ) + 

J \ 2 J\x-y\>l/bA J J 

d 2 x [ d 2 y-e^e-^)]e- Jd2x ^ (d ^ )2 ) 



+ ... 



l/A<\x-y\<l/bA 

1 

l/A<\x-y\<l/bA^ "J " ^ 2 (x - y) 2 * K ' 
... , I**{j£k(W?+V<*»*) + 

"l 



. + J V<j> (^y 2 ^ d 2 x I d 2 y i P ^)-i4>(y) 



\2ttK 



■ + / V(t>(y 2 [ d 2 x I d 2 y 

J Jl/A<\x-y\<l/bA J 

x (l + i{x - y)d<p{y) - \{x - y) 2 (d4>(y)) 2 ) ) e~ (sb <^> a +»»' 



... + 
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= ... + J V4>e- fd2x y2 lnfe ) CW+'H (10.152) 

In the last line, we have done the integral at irK = 2 (since we are interested 
in the vicinity of the transition) where it is logarithmic, and re-exponentiated 
the result. Hence, K^ 1 flows as a result of y: 

± K - 1 = ^y 2 + 0(y A ) (10.153) 
Together with the flow equation for y, 

^ = (2-7rK)y + 0(y 3 ) (10.154) 

these RG equations determine the physics of an XY model in 2 dimensions at 
finite temperature or in 1 dimension at zero temperature. These equations 
may be analyzed by defining u = irK — 2 and v = Any, in terms of which 
the RG equations are: 

- = -v 2 + 0(uv 2 ) 
du 

— = - uv + 0(v 3 ) (10.155) 
Observe that u 2 — v 2 is an RG invariant to this order: 

-^( w 2 -t, 2 )=0 (10.156) 

Hence, the RG trajectories in the vicinity of K = K c = 2/ir are hyperbolae 
which asymptote the lines u = ±v. The resulting Kosterlitz-Thouless flow 
diagram is shown in figure 10.10. 

These RG flows feature a line of fixed points - or a fixed line - y* = 0, 
K > K c . Any point below the the asymptote u = v — or, equivalently, 
7rK — 2 = Any - flows into one of these fixed points. Correlation functions 
exhibit power-law falloff at these fixed points: 

e iO(x) e -i0(p)\ = _L_ ( i .i 57) 

The line irK — 2 = Any which separates these power-law phases from the 
exponentially decaying phase is called the Kosterlitz-Thouless separatrix. At 
the critical point, 

' e u{x) e -iB(p)\_L (iai58) 

' \x\ 4 




Figure 10.10: The Kosterlitz-Thouless flow diagram. 

When the system is above the line irK — 2 = Any, it flows away to large y: 
the system is disordered by the proliferation of vortices and has exponentially 
decaying correlation functions. Since the cos0 term is relevant, it bounds 
the fluctuations of (j), just as an r</> 2 term would. In the problem set, you will 
show that as K c is approached from below, the correlation length diverges 
as: 

f ~ e (Kc-K)V2 (10.159) 

Hence, at finite temperature in 2D or at zero-temperature in ID, 4 He 
and XY magnets have a phase transition between a disordered phase and a 
power-law ordered 'phase'. 

10.16 Inverse square models in one dimension** 

10.17 Numerical renormalization group** 



10.18 Hamiltonian methods** 
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CHAPTER 11 



Fermions 



11.1 Canonical Anticommutation Relations 

In the remainder of this course, we will be applying the field-theoretic tech- 
niques which we have developed to systems of interacting electrons. In order 
to do this, we will have to make a detour into formalism so that we can han- 
dle systems of fermions. 

Let us first consider a system of non-interacting spinless fermions at 
chemical potential /x. As in the case of 4 He, we must modify the Hamiltonian 
by H —>■ H — jj,N. The action is the same as for a system of free bosons: 



The difference is that we want the associated Fock space to be fermionic, i.e. 
we would like the Pauli exclusion principle to hold. This can be accomplished 
by imposing the canonical anticommutation relations. 




(11.1) 




(11.2) 



{iP(x,t),ip(x',t)} 



(x,t),^ (f,i)} = 



(11.3) 



Performing a mode expansion 
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^ {X) = /(^4 e-&T "** (1L4) 

where = — \i = k 2 /2m — fx, we see that the creation and annihilation 
operators satisfy: 

{^^} = {4>i} = ° ( 1L5 ) 

Hence, (c|,) 2 = c| = 0, i.e. no state can be doubly occupied. 
The Green function is: 

Q(x,t) = 6{T)Tr {e-^-^VforMO^)} 

-0(-t) Tr { e -^'-^V(0, 0)^ (s, r)} (11.6) 

Note the —sign in the definition of the Green function. It is necessary 
because the fermions satisfy canonical anticommutation relations. You may 
verify that Q as defined above satisfies: 

Jr" + " M ) S{2 ' T) = 5{T) m (1L7) 

As in the bosonic case, we find a further condition which follows from 
the cyclic property of the trace. Since < r, r' < (3, it follows that — (3 < 
t — t 1 < [3. Now suppose that r < r'. Then, 

Q{r - T ' < 0) = -Tr { e -/^-^ e^"^^-^-^ e^-^t^-^-^) j 

m~vN) p li{H~^N) p T(H-vN) lh } ( ^-T{H-vN)-f3(H-^N) 



-Tr 



= _ Tr { e -W-^ e t>{H- lt N) e T(H- li N)^( x)e . v „ _, e 
e -'(H-MiV)^ (f / )e -r'(H- M iV) } 

= -g(r-r' + /3) (11.8) 

The first equality follows from the cyclic property of the trace. The final 
equality follows from the fact that r — r' + (3 > 0. Hence, a fermion Green 
function is anti-periodic in imaginary time. 

As a result of antiperiodicity in imaginary-time, we can take the Fourier 
transform over the interval [0,/?]: 

f/3 



G(ie n )= [ dre'^Qir) (11.9) 
Jo 
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where the Matsubara frequencies e n , are given by: 

£ „=<^ (U.10) 
Inverting the Fourier transform, we have: 

Q^) = -Sy{ie n )e^ (11.11) 

Using the mode expansion and the Fermi-Dirac distribution, 

Tr {e-KBo-^c} = n F (Ck) = (11.12) 

we can compute the propagator: 

Q(x, T ) = 0(T)Tr{e-0 (H °-^^ t (x,TMO,O)} 

-9{-t) Tr {e-^°-^V(0, 0)^(x, r)} 

___ e ~ik-^ k r {9{t) np{(k) _ 9{ _ T) (1 _ nj? (^.l 3 ) 

We can now compute the Fourier representation of the Green function: 

g(p,i€n) = J d 3 xe^ £ J P dTe- ienT g(x,T) 
n F (tk) - 1) 



(11.14) 



11.2 Grassmann Integrals 

Fermionic systems can also be described by functional integrals. In order 
to do this, we will need the concept of a Grassmann number. Grassmann 
numbers are objects ipi which can be multiplied together and anticommute 
under multiplication: 

ipiipj = -ipiipj (11.15) 



and 

1.2 



Vi = (11-16) 
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Grassmann numbers can be multiplied by complex numbers; multiplication 
by a complex number is distributive: 

a{ipi +^2) = aip! + bip 2 (11-17) 

ip = ipi + itp2 and ip = ip\ — iip2 can be treated as independent Grassmann 
variables, 

^=_^ (11.18) 

Since the square of a Grassmann number vanishes, the Taylor expansion of 
a function of Grassmann variables has only two terms. For instance, 

e^ = l + ip (11.19) 

Integration is defined for Grassmann numbers as follows: 

dip = 

dipip = l (11.20) 
Similarly, 

f dip dip = 
dip dip ip = 
dip dip ip = 
dipd^ip = l (11.21) 

As a result of the anticommutation, the order is important in the last line: 

'<il>$ = -l (11.22) 



y ^ ^ ' 

Since the square of a Grassmann number vanishes, these rules are sufficient 
to define integration. 

With these definitions, we can do Grassmann integrals of Gaussians. 
Suppose 0{ and 6{ are independent Grassmann variables. Then 

J d6 1 d6 1 ... d9 n d9 n e^ diAi ^ = J d6 1 d9 1 ... d6 n d6 n Y[e^ A ^ 
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d6 1 d6 1 . . . d6 n d6 n Yl (1 + OiAijOj) 

i,j 

= / dOiS\ . . . d6 n S n ( J 4la(l)^2cr(2) • • • 4 ff (n)) 
^ a 

x ^l^cr(l) • • • @nO<j(n) 

= (^1^(1)^2 CT (2) • • • A nu(n)) 

= det (A) (11.23) 
We can prove Wick's theorem for Grassmann integrals: 

Z(r H ,rj i ) = j \\de i de i e^M^ e ^^+ d ^) (11.24) 

i 

By making the change of variables, 

o i = o i ' + {A- 1 )..r h 



Oi = Bl + rj j (A- 1 ) ji (11.25) 



we get 



i 

= det(A) e^M*- 1 )^ ( 1L 26) 



Hence, 



.[\\;<l(>;<IO:th O r ...",,^< >: ° " 



d d d d r7l 



det(A) \drii! dfj h drj ik drj 



3k / J7 i=r)i =0 



cr 

= Y^(-ir{(e n e Ml) ) ... (e lk e Mk) )} (11.27) 

cr 

In other words, we sum over all possible Wick contractions, multiplying by 
— 1 every time the contraction necessitates a reordering of the fields by an 
odd permutation. 

Thus far, we have considered finite-dimensional Grassmann integrals. 
However, the generalization to functional Grassmann integrals is straight- 
forward. 
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11.3 Solution of the 2D Ising Model by Grassmann 
Integration 

In this section, we will present the solution of the 2D Ising model as an 
application of Grassmann integration. Our strategy will be to represent a 
state of the Ising model in terms of the domain walls which separate up- 
spins from down-spins. We will then concoct a Grassmann integral which 
generates all allowed domain wall configurations. 

We will assume that the Ising model is on a square lattice. The Grass- 
mann variables will live on the sites of the dual lattice. The first thing to 
observe is that there are 8 possible configurations of the four spins surround- 
ing a given site on the dual lattice, up to an overall flip of the four spins. 
They are depicted in figure ??. These 8 configurations will give a graphical 
representation of the possible non-vanishing terms in the Grassmann inte- 
gral. Thus, if we weight each of these terms in the Grassmann integral with 
the appropriate Boltzmann weight, the Grassmann integral will be equal to 
the partition function of the Ising model. 

Consider the following Grassmann integral. 

z = /IK; ^':;''- ;J ; ( n - 28 ) 

J i,j 

where i,j labels a site on the square lattice, and r^'? ', Xif are Grassmann 
variables. The h, v index stands for horizontal and vertical. Aij is given by 

■A-ij = Z XijVi+lj + z Xi,jVi,j+l 

+Xi,j r li,j + Xi,jVi,j + Xi,jXi,j + r li,j r li,j 

+xtd,3 + \ ;:,'/;:, ( n - 29 ) 

where z = e /3J . It is straightforward to allow anisotropic coupling constants 
Jh, J v by replacing the coefficient of the first term by Zh and the second by 
z v . The Grassmann integral (11.29) is quadratic, so it can be performed 
using the definitions in this chapter. We will do this shortly, but first we 
must establish the raison d'etre of this integral, namely that it reproduces 
2D Ising model configurations. 

When we expand the exponential of (11.29) we produce terms which are 
strings of x h ' Vs an d rj h ' v s. The integral of such a term is non-zero if and 
only if every single x h,v an d appears exactly once in the term. We 
can introduce the following graphical representation for the terms which are 
generated by expanding the exponential of (11.29). The first two terms are 
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represented by lines of domain wall connecting sites and (i + or 
(i,j + 1). They come with a coefficient z, which is the Boltzmann weight 
for such a segment of domain wall. 

The next four terms place a corner at site (i, j) . The four types of corners 
connect, respectively, an incoming line from the West to an outgoing line to 
the North (fig ??); an incoming line from the South to an outgoing line to 
the East (fig ??); an incoming line from the West to an incoming line from 
the South (fig ??); and an outgoing line to the North to an outgoing line to 
the East (fig ??). These terms implement these corners in the following way. 
Consider the first such term. When it appears in a string, XijVij cannot 
appear a second time in the string. Therefore, it is impossible to have a 
segment of domain wall originating at and heading to (i + It is 

also impossible for a segment of wall coming from — 1) to arrive at (i, j) . 
It is also impossible for any of the final five terms in Ai j to appear in the 
string because either Xij or Vij would appear twice. However, x\j or Vij 
must appear once in the string. This can only happen if the second term in 
Aij and the first term in Ai-ij appear. Hence, this term places a corner 
at site Note that two such corners cannot occur at the same site. 

However, a horizontal domain wall can cross a vertical domain wall, which 
accounts for the configuration in fig ??; this occurs when the first term in 
Ai-ij and Aij appear and the second term in -Ajj-i and Aij also appear. 

The last two terms simply prevent, respectively, a horizontal or vertical 
domain wall from passing through In a configuration in which there is 

no domain wall passing through both will appear. If there is a vertical 

domain wall passing through then the first of these terms will appear 

(as will the second term of Aij, with its concomitant z). 

Thus, we see that by expanding the exponential of bilinears in Grass- 
mann variables we can reproduce all possible domain wall configurations. 
The only concern is that some of them may not come with the correct sign. 
The reader may check that they do, essentially because every loop of domain 
wall contains an even number of segments and and even number of corners. 

Thus, we can, indeed, compute the partition function of the 2D Ising 
model by evaluating the Grassmann integral above. It is equal to the square 
root of the determinant of the quadratic form in We- can almost 

diagonalize it by going to momentum space: 




(11.30) 



with a similar equation for x- Now the exponent in the Grassmann integral 
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can be rewritten: 

^ ^^iJ 1 ^ \^Z G- ^ — ky ^7kx iky ^ ^ ^ — &X 5 — ky ^ kx iky 



1,3 



^ — kxi — ky^lkx^ky — kx 5 — ky^ikx ,ky 

^ ; ky'X-kx,ky ^1 kx , ky^lkx ,ky 



^ ^ ^X ; ky^lkx^ky ^ kx , fcy ^fc^ ; 

^ _ ^ ^ ^ ' ky^lkx,ky _ ^ ^ ^ ^X ) ky^kxiky 



kx-fky 



^ ^ , ky^lkx ,ky kx, ky^lkx,ky 

^ kx i ky^^-kx }ky ^ ^7 kx i ky^lkx -jky ( ^ ) 

Up to a normalization N, the Grassmann integral is given by the: 
Z = M II (detM(k x ,k y )) 1/2 

kx iky 

= Me hV!^^M {kx , ky) (n32) 

where M(k x ,k y ) is the 4x4 matrix of coefficients in (11.31) and V is the 
area of the system. Computing the determinant of this 4x4 matrix, we 
find: 

1 f d 2 k 

In Z = - V / - — -r In [cosh 2 2/3 J - sinh 2(5 J (cos k x + cos fc w )l (11.33) 

2 y (27r)^ 

The infrared behavior of this integral, where possible singularities lurk, 
can be obtained by studying small k. In this limit, the argument of the 
logarithm is of the form Kp 2 +r, where r = (sinh2/3J — l) 2 . This is precisely 
what we would expect for a system of particles with propagator Kp 2 + r. 
Such a system has correlation length £ ~ Vv^"- There is a critical point, 
and its concomitant singularities and power-law correlation function occur 
when r = 0, i.e. sinh2/? c J = 1. Expanding about this critical point, we have 
£ ~ 1/|T — T c \, i.e. v = 1, which is rather different from the mean-field 
result v = 1/2. Performing the integral above, we obtain the free energy 
density: 

F = F Ilon _ sing . - — (sinh 2/3 J - l) 2 In |sinh 2(3 J - 1| (11.34) 
2tt 

Hence, the specific heat diverges logarithmically, a = 0~ . 
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11.4 Feynman Rules for Interacting Fermions 

Let us now turn to a system of fermions with a 5-function interaction. The 
grand canonical partition function is given by: 

Z = N j VipV^ e~ s (11.35) 

where the functional integral is over all Grassmann-valued functions which 
are antiperiodic in the interval [0,(3] (so that e n = (2n + 0). 

S = j'^dr J d d x (V (dr ~ ~ l^J i> + V^^i^j (11.36) 

This action has the U{1) symmetry tp — ► e te ip, tp^ — > e~ t9 tp^ . According 
to Noether's theorem, there is a conserved density, 

p = iftij) (11.37) 

and current 

j = ^— 1 f,- ( ) ^ (11.38) 
m \m J 

satisfying the conservation law 

^ + V-J=0 (11.39) 
For V = 0, this is the free fermion functional integral: 

/ e -S!*S = det [d T - - M )) (H.40) 

The Green function is: 



Q{x,t) =M j Viplfyt iI>*(x,t)iI>(0,0) e 



*- (£""))" (1141) 

The difference between this Green function and the Green function of a 
bosonic system with the same Hamiltonian is that this is the inverse of this 
operator on the space of functions with antiperiodic boundary conditions on 
[O,0\. The Fourier transform of the Green function is: 

G(k,en) = ^ v (11.42) 

\2m 
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Figure 11.1: The graphical representation of the fermion propagator and 
vertex. 

In the presence of source fields rj, fj, 

Z [rj,fj] = N J TH/,Ttyt e -ti* S * x +^<&-*))^™ 

= e -Jo drdT ' fd d xd d x'fj(S,T)g{x-x',T-T')r,(x',T') (11.43) 

For interacting fermions, it is straightforward to generalize (7.31) to Grass- 
mann integrals, so that 

N - r r (a. aS\ 
%^] = F e JLmt y*>'»>) Zofafj] (11-44) 

In applying this formula, we must remember that rj and fj are Grassmann 
numbers so a — sign results every time they are anticommuted. As in the 
bosonic case, we can use (7.32) to rewrite this as: 

(11.45) 

By expanding the 

e - / Cint{^ ,tl>)+ri^ +4>f) (11.46) 

we derive the following Feynman rules for fermions with <5-function inter- 
actions. The lines of these Feynman diagrams have a direction which we 
denote by an arrow. Each vertex has two lines directed into it and two lines 
directed out of it. Momenta and Matrsubara frequencies are directed in the 
direction of the arrows. The propagator and vertex are shown in figure 11.1 

• To each line, we associate a momentum, p and a Matsubara frequency, 



• The propagator assigned to each internal line is: 
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• For each vertex with momenta, Matsubara frequencies (pi , e m ) , (f>2 , e n2 ) 
directed into the vertex and (ps, e„ 3 ), (p^, e n4 ) directed out of the ver- 
tex, we write 

V (2vr) 3 <5(pi +P2-P3- Pi) (3 5„ 1+ „ 2 ,„ 3 +„ 4 

• Imagine labelling the vertices 1, 2, . . . , n. Vertex i will be connected 
to vertices j±, . . . ,j m (m < 4) and to external momenta pi, ■ ■ ■ ,P4- m 
by directed lines. Consider a permutation of these labels. Such a 
permutation leaves the diagram invariant if, for all vertices i, i is still 
connected to vertices j\ , . . . , j m (m < 4) and to external momenta 
pi,...,P4_ m by lines in the same direction. If S is the number of 
permutations which leave the diagram invariant, we assign a factor 
1/S* to the diagram. 

• If two vertices are connected by / lines in the same direction, we assign 
a factor l/l\ to the diagram. 

• To each closed loop, we assign a factor of —1. 

The final rule follows from the necessity of performing an odd number of 
anticommutations in order to contract fermion fields around a closed loop. 

For /j, < 0, (11.36) describes an insulating state. There is a gap to all 
excited states. For V = 0, the gap is simply — //. In the problem set, you 
will compute the gap for V / perturbatively. 

For fi > 0, the ground state has a Fermi surface. For V = 0, this 
Fermi surface is at kp = \J2m[i. In the problem set, you will compute the 
Fermi momentum for V ^ perturbatively. Since this phase has gapless 
excitations, we must worry whether the interaction term is relevant. If the 
interactions are irrelevant, then we can perturbatively compute corrections 
to free fermion physics. If interactions are relevant, however, the system 
flows away from the free fermion fixed point, and we must look for other 
fixed points. Such an analysis is taken up in the next chapter, where we 
see the importance of the new feature that the low-energy excitations are 
not at k = 0, but, rather, at k = /cp. We construct the renormalization 
group which is appropriate to such a situation, thereby arriving at Fermi 
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Figure 11.2: A one-loop diagram with an intermediate (a) particle-hole pair 
and (b) particle-particle pair. 
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liquid theory. First, however, we will investigate the two-point function of 
the interacting Fermi gas perturbatively. 

Fermion lines with arrows that point to the right represent fermions 
above the Fermi surface. Those which point to the left represent holes 
below the Fermi surface. This is analogous to electrons and positrons in 
QED. However, unlike in QED, where a positron can have any momentum, 
fermions must have k > kp and holes must have k < kp at T = (at 
finite-temperature, this is smeared out by the Fermi function). Hence, the 
diagram of figure 11.2a corresponds to the expression 



When q < kf, the second Green function represents the propagation of a hole 
at q while the first Green function represents the propagation of a fermion at 
p + q. If p + q isn't above the Fermi surface (smeared by the Fermi function), 
then this expression vanishes, as we will see shortly. Similarly, when q is 
above the Fermi surface, p + q must be a hole below the Fermi surface. 
Meanwhile, the diagram of figure 11.2b corresponds to the expression 



where q and p — q are now both fermions above the Fermi surface. 

11.5 Fermion Spectral Function 

Following our earlier derivation of the phonon spectral representation, we 
construct a spectral representation for the fermion two-point Green function. 
By inserting a complete set of intermediate states, \m)(m\, we have, 



<?(*, r) = / d 3 pde?£S(p- Pm + Pn)S(e - e„ m )(0(r)e-^ +CT 




(11.47) 




(11.48) 



n,m 




The Fourier transform, 




(11.50) 
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is given by: 



771 



^(0,0) 
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x 5(p - p m + p n )5(E - E m + E n )] 



E — ie-i 



:n.5i) 



Writing 



£) = J] (e"^ + e"^) |(m |^+(0, 0)| n)| 2 <5(p-p m +p n ) 5(E—E mn ) 

n,m 

(11.52) 



we have the spectral representation of Q: 



E- tej 



(11.53) 



As usual, the spectral function A(p, E) is real and positive. It also satisfies 
the sum rule: 



Of****- 1 



(11.54) 



Q is not analytic since it does not satisfy the Kramers-Kronig relations. 
However, the advanced and retarded correlation functions, 



Gad v (p,e) 



°° dE Mp.e) 



-00 

oo 



dE 



E-e-iS 
A(p,E) 

E-e + iS 



(11.55) 



are analytic functions of e in the upper- and lower-half-planes, respectively. 

As usual, the spectral function is the difference between the retarded 
and advanced correlation functions. 



G re t(p, e) - G a dv(p, e) = 2mA(p, e) 
The spectral function of a free Fermi gas is a 5-function: 



(11.56) 



(11.57) 



In an interacting Fermi gas, the spectral weight is not concentrated in a 8 
function but spread out over a range of frequencies as in figure 11.3. 
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Figure 11.3: The spectral function in free and interacting Fermi systems. 

11.6 Frequency Sums and Integrals for Fermions 

To compute fermion Green functions perturbatively, we will need to do sum- 
mations over Matsubara frequencies. Sums over fermion Matsubara frequen- 
cies can be done using contour integrals, as in the bosonic case. Consider 
the Matsubara sum: 

iVV?(i$7 m + ie n ,p + q)G(ie n ,q) = I ^-n F (e)Q{j£l m + e,p + q) G{e, q) 
P „ Jc 2 ™ 

(11.58) 

where the contour avoids the singularties of the Green functions, as in chap- 
ter 6. e n and £l m + e n are fermionic Matsubara frequencies, so Q m is a 
bosonic one. The contour integration is given by two contributions: e real 
and iQ-rn + e real. Hence, 

i i r°° 

J2Q(i^m + ien,P + q)Q(ie n ,q) = 7T~ / dEn F (E) Q{E + itt m ) {Q{E + iS) - Q(E - iS)) 

P n * m J—oo 

1 f°° 

+— / dE n F (E - ifi ro ) (Q{E + iS) - Q(E -iS))G(E- zfi m ) 

Analytically continuing the imaginary-time Green functions, we have: 
1 f°° 

-Y^G{i^ m +ie n ,P + q)G{ie n ,q) = / dE n F (E) Q(E + i£l m ,p + q) A(E, q) 

P „ J—oo 
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/oo 
dEn F {E) Q(E - in m ,q)A(E,p + q) 

(11.59) 

In the case of free fermions, the spectral function is a <5-function, so the 
dE integrals can be done: 

^( in m + ie n ,P + q)g(ie n ,q) = ^M^E^±sl (1L60) 

At zero-temperature, the discrete frequency sum becomes a frequency 
integral, 

so 

1 , f°° de 

-^2g(iU m +ie n ,p + q)g(ie n ,q) -> J —g(i9, + ie,p + q)Q(ie,q) (11.62) 
Using the spectral representation of Q we can rewrite this as: 

+ a «*,«) - r f r iEl r ^ ^±|^ f$ 

)2vr J_ 00 2irJ_ 00 y.^ E 1 -iQ-i€ E 2 -te 



The <ie integral can be done by closing the contour in the upper-half-plane. 
The pole at e = — iE\ — O is enclosed by the contour when E\ < 0; the pole 
at e = — 1E2 is enclosed when E2 < 0. Hence, 

f°° + ie, v + 0) 9(ie, <f) = f°° dE, f°° dE 2 ^'f^ ~ 9( ~ El) A(p + q, E ± ) A^M) 

J-oo 27T J -00 J -00 E2 — E-1+IU 

In the case of free fermions, the dEi integrals may be done: 

^ ^G(in + ie,p + q) Q(ie, q) = d ±<sLzlt<E±il ( n. 65 ) 

J-oo 27T «i + ?g-4p+<? 

which is the zero-temperature limit of (11.60). 

11.7 Fermion Self-Energy 

We can begin to understand the role played by the Fermi surface when we 
start computing perturbative corrections to the behavior of free fermions. 
Let us look first at the fermion two-point Green function. As in the bosonic 
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case, we can define the self-energy, S(e, k), as the 1PI two-point function 
and sum the geometric series to obtain: 



G(p,ie n ) = 



(11.66) 



le 



The retarded Green function is defined by analytic continuation: 

G iet (p,e) = j— ^ 

e ~ (hi -A*J -£ret(e,p) 

The spectral function can be written as: 

1 -2ImS rct (e,_p) 



(11.67) 



A(p, e) = 



(11.68) 



n ( e ~ (f^ ~ A*) ~ ReS rct (e,p)) + (ImS re t(e,p))' 
When ImS ret (e,p) = 0, the spectral function can be rewritten as: 

A(p,e) = Z(p)6(e-£ p ) (11.69) 
where £ p is the location of the pole, defined by the implicit equation 



p 

^ p = 2m ~ M ~ ReS rct(£p,P) 



(11.70) 



and Z(p) is its residue 



Z{p) 



I - ( ^ReS rct (e,p) 



£ p can be expanded about the Fermi surface: 

i P = v*f(p~ Pf) +0({p- p F ) 2 ^ 

where 



PF 
Vp = — - 

m* 



(11.71) 



(11.72) 



(11.73) 



and m* is the effective mass. £ p and define the one-particle density of 
states, N(cf), of the interacting problem. 



d d k 



r d d i 

J (&F 



2tt2 



(2vr)^r (I) 



J k^dk 
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d 



(27r)^r(|) ^ y ^ fe 



= AT(e F ) J d^ k (11.74) 

The lowest-order contribution to Im£ rct (e,p) comes from the diagram of 
figure ??. We can do the zero-temperature calculation by contour integra- 
tion: 

(H-75) 

The d( integral may be done by contour integration, as in (11.65): 

d d q r d d P [duj d(-Q - e{-^ p+q ) 



/d q f d p f duj 



(11.76) 

The doo integral may be done the same way: 

d d q f d d p (9Kp)-9(-W)(^H)-%-W) 



ICO + Cp — ip+q — ik-q 

(11.77) 

Hence, the imaginary part of the self-energy at zero-temperature is: 
ImS ret (e, k) = y 2 kp y2 J d Zpf o d Zk- q (2^p d (ll)d 6 & ~ Zp+q ~ + e) 

d ~ 1 ) f0 re 

<v 2 ^^sj_ 1 / dt P 

V F J-e J 

= V 2k -^S\_^ (11.78) 

V F 

Hence, we have seen that the phase space restrictions (imposed by the 
5 function above) due to the existence of a Fermi surface severely restricts 
ImS rct (e, k). For e — > 0, ImS re t(e, k) ~ e 2 . In other words, for e small, 
the decay rate is much smaller than the energy: near the Fermi surface, 
single-fermion states are long-lived. 
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11.8 Luttinger's Theorem 

Up until now, the Fermi surface has essentially been a tree-level, or free 
fermion, concept. However, the notion of a Fermi surface is not tied to 
perturbation theory. In fact, the existence and location of a Fermi surface 
is constrained by a non-perturbative theorem due to Luttinger, which we 
now discuss. Luttinger's theorem defines the Fermi surface as the surface 
in fe-space at which G(0, k) changes sign. Inside the Fermi surface, G(0, k) 
is positive; outside the Fermi surface, G(0, k) is negative. In a free fermion 
system, G(0, k) diverges at the Fermi surface. (In a superconductor, G(0, k) 
vanishes at the Fermi surface, as we will see later.) According to Luttinger's 
theorem, the volume enclosed by the Fermi surface is equal to the electron 
density, N/V, so long as ImS(0, k) = 0. 
To prove this, we begin with 



In the second line, we have the time-ordered Green function; the advanced 
and retarded Green functions vanish at equal times. If we write 




(11.79) 




1 



(11.80) 



then 




(SP^[*" ,G( ' ; ' £) - G( ' ; ' e) * E(£ ' i:) 




(11.81) 



We will now use the following 'lemma' which we will prove later: 




(11.82) 
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Then 



TV 

v =l 



d d k de d 

— — lnG(fc,e) 



(27r) d 2tt de 

/d d k de d f 



Since G re t is analytic in the upper-half-plane, the first integral vanishes. 
Also note that G = G TCt for e > 0, while G* = G TCt for e < 0. Hence, 



N 

v= l 



d d k f° de d , G(ife, e) 
— — ln ■ 



/ 



(27r) d 



ln 



,27r <9e G ret (A;, e) 
G(fc,e) 1° 



(27T) d L G ret (Mj-oo 

d d k 

J^y [V(0,fc) - <p(-00,fc)] 



From the spectral representation, 



G(£e) 



/: 



A(p,£7) 



dE 1 ■ 

e - £ + i<5 sgn(e) 



(11.84) 



(11.85) 



and the normalization property of the spectral function, we see that ip(—oo, k) = 
7r. Hence, 

H0,k)-7r] (11.86) 



N 
V 



(2n) 6 



Since ImS(0, k) = by assumption, ip{0, k) is equal to or ir. The integral 
only receives contributions from the former case: 



N 
V 



d d k 



(11.87) 



where 1Z = {k\G(0, k) > 0}. In other words, the volume enclosed by the 
Fermi surface is equal to the electron density. 

To complete the proof of the theorem, we must prove that 

d d k de r) 

(^2^ e) * E<t ' fc) =° (1L88) 
To do this, we prove that there exists a functional X[G] defined by: 



SX = 



duj 



d 3 k 



2vr J (2tt) 



(11.89) 
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According to this definition, 

d 3 k 



f du r d 3 k 
J 2^ J W) 



^T,(u; + e,k)5G(uj + e,k) (11.90) 



Hence, 



/did f d k d 

ltSw?^ k) irJ G ^ k) (im) 



However, X is independent of e, so 5X/Se = which proves (11.88). 
To see that X actually exists, observe that 

' )A S(p) (11.92) 



SG{p) 
Hence, 

5 2 X _ 6X(p) 
8G(p)5G(q) ~ 5G(qj 

X exists if and only if the derivatives can be commuted: 

6 2 X 6 2 X 
5G(p)5G(q) ~ 5G(q)5G( P ) 

Since 

<JE(p) 



(11.93) 



(11.94) 



SG(q) 



T(p,q) (11.95) 



where T(p,q) is the irreducible 4-point function with external momenta 
p,p, q, q and T(p, q) = F(q,p), the existence of X follows. 

In the case of a free Fermi gas, = (c k Ck) is a step function, = 
9{k,F — k) with J rifc = N/V. One might imagine that, in an interacting 
Fermi gas, it would be possible to have n& = X6(kF — k\) with A < 1 which 
would preserve J n k = N/V while moving the location of the singularity 
in rifc to kp/X. Luttinger's theorem tells us that this cannot happen. The 
singularity in = f deG(e, k) is fixed by the density to be at &f. 
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CHAPTER 12 



Interacting Neutral Fermions: Fermi Liquid Theory 



12.1 Scaling to the Fermi Surface 

We now consider a rotationally invariant system of interacting spinless fermions 
with fx > in D > 2. The RG analysis of such systems was pioneered by 
Shankar, Polchinski, .... The Fermi sea is filled up to some hp. First, let 
us examine the free part of the action, 




(12.1) 



(12.2) 



If the kinetic energy is much larger than the potential energy, then it makes 
sense to focus first on it and use it to determine the scaling of ip from the 
kinetic energy. 

Since we will be interested in low-energy, i.e. the vicinity of the Fermi 
surface, we make the approximation 

k 2 

- ii k, v F (k - k F ) 

2m 

= v F l (12.3) 
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where I = k — Hf- We can also make the approximation 



F 

Hence, the action can be written as: 



JL_ j dld d Sl—^{ie-v F l)il> (12.5) 



d d k = k d f l dkd d Vt 

= k d ~ 1 dld d n (12.4) 



(2vr) d 

Restoring the cutoffs, 



h.d-1 f-A roc J 

i / dld d n / —^{ie-v F l)i> (12.6) 

J -A J-oo 27r 



(27T) 



The momentum integral is restricted to a shell of thickness 2A about the 
Fermi surface. We leave the frequency integral unrestricted (as we did in 
the case of the 0(3) NLuM). The angular integral has no cutoff, of course. 
Our RG transformation now takes the following form: 

• Integrate out ^(l,Q,€),ip^(l,Q,e) for bA < \l\ < A and e, D, arbitrary. 

• Rescale: 

to — > buj 
I bl 

ij) b~%ip (12.7) 

The principal difference between the renormalization group applied to a 
system with a Fermi surface and its application to more familiar contexts is 
that the low-energy degrees of freedom are located in the neighborhood of a 
surface in momentum space, rather than in the vicinity of a point. Hence, 
we do not scale to the origin of momentum space, k = 0, but to the Fermi 
surface, / = 0. 

The free fermion action (12.5) is evidently a fixed point of this RG trans- 
formation. Thus, as we would expect, a free fermion system looks the same 
at any energy scale: it is always just a free theory. 

To this action, we can add the following perturbation: 

t.d-1 pA poo j 

jwIj'^L^* (i2 - 8) 
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Under the scaling (12.7), S/j, scales as: 



S/j, — > b 5jjL 



(12.9) 



Since 5/x is a relevant operator, we cannot study it perturbatively. Relevant 
operators typically bring about a fundamental change of the ground state, 
and dfi is no different. Changing the chemical potential shifts the Fermi 
surface. If we change coordinates to P = I + then we recover (12.5). 

In a system which is not rotationally invariant, 5fi can depend on the 
angle around the Fermi surface, 8/j, . S/j, (jlj is an example of a 'cou- 
pling function', which is a generalization of a coupling constant. Such a 
perturbation can change the shape of the Fermi surface. 

A second perturbation is 



Svf shifts the Fermi velocity. It is clearly a marginal perturbation. 

12.2 Marginal Perturbations: Landau Parameters 

Let us now consider four-fermion interactions. Consider the term 



Rather than a single coupling constant, u, we have a coupling function, 
u(k\, &2> k%, k^). The RG equations for a coupling function are called func- 
tional RG equations. We will assume that u(k±, k2,k%, k^) is non-singular or, 
in other words, that the fermions have short-ranged interactions, as in 3 He. 
We will deal with the complications resulting from Coulomb interactions in 
the next section. 

If u{ki, k2, &2, k\) is a function only of the angular variables, u(Cli, J^, ^3, ^4), 
then it is marginal, i.e. it does not scale under (12.7). If it depends on the 
\h\ — feel's, then it is irrelevant, so we ignore this possibility. Similarly, 
six-fermion, eight-fermion, etc. interactions are neglected because they are 
highly irrelevant. Furthermore, momentum conservation implies that 




(12.10) 



S 4 = 



du>\duj2du>z d d k\d d k2d d kz u{k\, %2, £3, k^)x 

ip^{k A , (fc 3 , i0 3 )ip(k 2 , L0 2 )Tp(ki, LOi) (12.11) 



u(ki,k 2 , k 3 , fe 4 ) = u(ki,k 2 , h) 8 [ki + k 2 - k 3 - kA (12.12) 
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Figure 12.1: (a) If all of the momenta are constrained to lie on the Fermi 
surface, incoming momenta ki, k<i can only scatter into k-$ = k\,k± = ki or 
&3 = &2, fe4 = fei, unless (b) k\ = — hi- 
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In the last section, we considered the case of 6 function interactions, for 
which u(ki,k2,kz) = V. Here, we are considering the more general case of 
arbitrary (non-singular) u(ki, &2, k 3 ). 

The crucial observation underlying Fermi liquid theory, which is depicted 
in Figure 12.1, is the following. Consider, for simplicity, the case of D = 2. 
For A <C kp, u(ki,k2,ks) = for generic ki,k%,k3 because £4 typically 
does not lie within the cutoff. The constraint of momentum conservation, 
ki + k2 = k 3 + A?4 together with the restriction that k\ , k,2 , £3 , £4 lie within 
A of the Fermi surface severely limits the phase space for scattering. As 
we scale to the A — > limit, only forward scattering, u(k±, k2, k±, A^) and 
exchange scattering, u(k±,k2,k2,ki) = — u(k±, &2, fei, ^2), can satisfy mom- 
ntum conservation. At small but non-zero A, a small subset of the u's are 
non-zero. As A is decreased, some of these are set discontinuously to zero; 
the rest do not scale. As A becomes smaller, fewer non-zero u's remain until, 
finally, at A = 0, only the three mentioned above remain. It is this drastic 
simplification which makes Fermi liquid theory soluble. 

In three dimensions, the angle between fc 3 and £4 is the same as the angle 
between k\ and &2 

0(£i,&) = ±0(fc 3 ,fcO (12.13) 

but the plane of fc 3 and k^ can be rotated relative to the plane of k\ and k2 
by and angle <j) as in figure 12.2. 

These phase space restrictions imply that in two dimensions, we should 
focus on 

F(e 1 -9 2 ) = 62,^2,^1) (12.14) 

Fermi statistics dictates that exchange scattering is related to forward scat- 
tering by: 

14(^1,^2,^2,^1) = —u(Qi, ^2,^1,^2) 

= -F(9 l -9 2 ) (12.15) 

In three dimensions, we should focus on: 

f(^i • n 2 , <l>) = u{n u n 2 , n 3 , f^) (12.16) 

in the A — > limit. The Fourier components of F are called Landau Param- 
eters. 

There is one loophole in the preceeding analysis, depicted in figure 12.1b. 
If k\ = —k2, then /c 3 = — k± is arbitrary. This is the Cooper pairing channel. 
We write: 



V(tti • 3 ) = u(ki,-ki, fe 3 , -k 3 ) 



(12.17) 




Figure 12.2: In three dimensions, the outgoing momenta can be rotated 
relative to the incoming momenta. 



In D = 2, this can be written as: 

V{0i - e 3 ) (12.18) 
Then, at tree-level, in the A — ► limit, we have the following action: 

S =r E - 1 / dl d d n (ie - v F l) V 
(27r) d J 2vr y ' 

+ J deide 2 de 3 dkidk 2 dk 3 d d Qid d Q. 2 F(^i ■ Q 2 , (f>) x 
V' t (/c 4 , £4)^(^3, £3)^(^2, e 2 )ip{ki, ei) 

+ J deide 2 de 3 dfcidfc 2 dfc 3 d^d^g V(^i • fi 3 )x 

^(^,€4)^(^3, €3)^(^2, e 2 )V>(fci,ei) (12.19) 

For A finite, we have to keep the full coupling function u(S7i, f^, ^3, ^4) 
with 

ni-n 3 



< 



or 



< 



A_ 



(12.20) 
(12.21) 
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Figure 12.3: The phase space available to k% and k± when A is small but 
non-zero is the region bounded by the arcs which has area ~ A 2 . 



as in figure 12.3 



12.3 One-Loop 

At tree-level, F(Qi — Q2) and V(Qi — Q3) are marginal. We would now like 
to compute the one-loop RG equations for F(&i — U 2 ) and V(fii — Q3). 

First, consider the renormalization of F. The one- loop diagrams are in 
figure 12.4. Since F is independent of the frequencies and the Zj, we can set 
the external frequencies to zero and put the external momenta on the Fermi 
surface. The first diagram gives a contribution 

dF(ti 1 -n 2 )= I dldndeFi^Fin + n!-^) — 7 — 7 (12.22) 

J xe — VFiit — vfI 

which vanishes since both poles in e are on the same side of the axis. 

The internal momenta in these diagrams must lie in thin shells at the 
cutoff, A — dA < \p\ — h,F < A. In the second diagram, p and p + k\ — k 2 must 
both satisfy this condition. The condition on \p\ restricts its magnitude; the 
condition on \p+k±— k 2 \ restricts the direction of p. The kinematic restriction 
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Figure 12.4: The one-loop diagrams which can contribute to the renormal- 
ization of F. 

is essentially the same as that depicted in figure 12.3. As a result, the dl 
and dVL integrals each give a contribution proportional to dA, and therefore 

dF ~ (dA) 2 (12.23) 

in the dA — > limit, this gives a vanishing contribution to dF/d£. The 
third diagram gives a vanishing contribution for the same reason. Hence, at 
one-loop, 

^-F(n 1 -n 2 ) = o (12.24) 

di 

The Landau parameters are strictly marginal; they remain constant as we 
scale to lower energies. 

We now turn to the one-loop RG equations for V. The relevant diagrams 
are analogous to those of 12.4. The first two diagrams are proportional to 
(dA) 2 and, therefore, do not contribute to the RG equation. However, the 
third diagram gives the contribution 

f df dl d d Q - 1 1 

dv(n ± - n 3 ) = - / f ^ 7 f^ T m - nmn - n 3 ) . . . . 

7 2-7T 27T (27r) a 1 xe — vfI —it — vfI 
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/s^ v(fi '- fi)v(fi - fl » ) 



2vpp 

= ^r F de I wj^ v ^ - ^ - ^ (12 - 25) 

In two dimensions, we write 

/•27T JO 

V m = J ^e im6 V{9) (12.26) 
The renormalization group flow equation for V\ is: 

V? (12.27) 



dVl 1_ F 2 

din A 2ttvf 



™ - 1+ ^S', n(Ao/A) 

Therefore, repulsive BCS interactions are marginally irrelevant, while at- 
tractive BCS interactions are marginally relevant. From (12.28), we see 
that an attractive BCS interaction will grow as we go to lower scales, until 
it reaches the scale: 

A~ A e -2^/|Vi(Ao)| ( 12 .29) 

As we will discuss later, at this scale, pairing takes place. In the BCS theory 
of a phonon-mediated superconductor, this leads to a critical temperature 
or zero-temperature gap given by 

T c ~ A(T = 0) ~ uj d e -2^F/|Vb| (12.30) 



Bardeen, Cooper, and Schrieffer found the paired ground state using a vari- 
ational ansatz. A more general formalism, which can be used when the in- 
teractions are retarded was pioneered by Nambu, Gor'kov, and Eliashberg. 
Both of these approaches owe their success to the kinematic constraints of 
the problem. There are no relevant interactions other than F(p,\,£l2) an d 
V(Qi,Q,3). .F(fii,Jl2) does not contribute to the running of V(£li,£l3), so 
the diagram of Figure ?? is essentially the only diagram which must be 
taken into account. BCS theory and its refinement by Nambu, Gor'kov, 
and Eliashberg are mean-field theories which evaluate this diagram self- 
consistently. These theories will be discussed in chapter 16. 
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12 A 1/N and All Loops 

The one-loop structure of a system of interacting fermions is actually stable 
to all orders in perturbation theory. The essential reason for this (which 
was first recognized in this language by Shankar) is that A/kp is a small 
parameter like 1/N. To see this, consider the case D = 2. Break the 
angular integration into pieces, A9 = A/kp with 9j = 2irj(A/kp) and j = 
0, 1, . . . , k F /A. 

de^y" de (i2.3i) 

JO i^Oi 

Then, we can write 

72 



5 = E/ tct^^(^-m)^ 

i 

Ef de\ de-2 de 3 (Pk\ d 2 k2 d?k 3 d 2 k^ 
J ~2tt ~2tt ~2~k ~2tt 2tt 2tt 2vT 



Fij i/>j(fa, £3)^(^2, £2) ipiih, £4)^(^1, ei) ^ (ki + k 2 -k 3 - k^j 

Ef de\ dt2 de 3 d 2 k\ d 2 k2 d 2 k 3 d 2 k^ 
J 2tT 2tT ~2tt ~27r 2tt 2tt 2tT X 

Vij fplifo, tz)4>\ k F (h, £4) ipi(.k 2 , e 2 )ip,,k F (fci, ei) 5 ( fci + k 2 - k 3 -f-12 4 3)2) 



We have broken the angular integral into a summation over Fermi surface 
'patches' and an integral over each patch. Hence, F{9i—9j) has been replaced 
by F^. By restricting to F^ rather than allowing Uijki, we have automat- 
ically restricted to nearly forward scattering - i.e. to scattering from one 
point to another within the same patch. Furthermore, the (5-function, 

5 (jfei + k 2 - k 3 - fc 4 ) (12.33) 

does not contain any momenta of 0(kp); the fcj's live within patches and, 
therefore, are all less than the cutoff. 
In this expression, 

A f-2wA/k f 



d 2 k = dl de 

J-A Jo 



A i-kA 

dk ± / dh\ (12.34) 

A J-ttA 
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so both momenta have cutoff ~ A. 

Hence, if we rescale all momenta by A and the field, ip, as well: 

to 

to — > — 

A 

ip -► A 2 V> (12.35) 

we can rewrite the action as: 

k F /A 



£ / (2^^^ t(k_M) ^ 



i=0 

E^ A /" dei de 2 de 3 d' 2 ki d 2 k 2 d 2 k 3 & 2 k± 

I ~2tt ~2tt ~2tt ~27r 2tt 2tt 2tT * 

1,3=0 ' 



Fij ipj(k 3 , e 3 )ipj(k 2 , e 2 ) ^1(^4, £4)^(^1, ei) $ (ki + k 2 -k 3 - k^j 



kF 

e'/ 

Vij ^(&3, e 3 )ip* k (k^ e 4 ) ^(^2, e 2 )tp kp (ki,ei) 5[k 1 + k 2 -k 3 -fll 4 36) 



fc ^-f" /" cZei de 2 (ie 3 <i 2 A;i d 2 k 2 d 2 k 3 d 2 ki 
' ' ^^T^T 2tt 2tt 2tt 2tt 



ij'=o 



A 

In other words, if we write N = kp/A, 
V ' d 2 k de 

i=0 

Ef de\de 2 de 3 d 2 k\d 2 k 2 d 2 k 3 d 2 k^ 
. „ / 2tT 27r" 27r" ~2tt 2tt 2tt 2tT * 

1,3=0' 



S = t-J Wf^ i{ie ~ VFl)i)i 

N 

— ^j(fc 3 , e 3 )ipj(k 2 , e 2 ) ^1(^4, £4)^(^1, ei) ^ (&1 + ^2 - £3 - £4) 

Ef de\de 2 de 3 d 2 k\d 2 k 2 d 2 k 3 d 2 ki 
J ~2tt ~2tt ~2tt 2tt 2tt 2tT X 

»,j=o 

— Vij^\{k^ez)il}\ k (fc 4 ,e 4 ) ipi{k 2 ,e 2 )ip kp (fci,ei)d (fci + fc 2 - £3 -flS 4 37) 

iV J 2A V y 



then we see that we have a model in the large A" limit. 

Recall the analysis of the 0(N) model in the large N limit. The only 
0(1) corrections to the two-point function are diagrams of the form of figure 
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??a. These shift the chemical potential. The non-trivial diagrams such as 
??b are 0(1/N). Consider now the correction to the four-point function. 
Only diagrams such as those of ??a,b are 0(1/N). In the case of forward 
scattering, ??a vanishes because both poles are on the same side of the axis. 
In the case of Cooper scattering, ??b, gives a non-trivial contribution. The 
other corrections, such as those of ??c are 0(1/ N 2 ). 

As we learned earlier in the context of 0(N) models, the large- N limit 
introduces the following simplifications. The only diagrams which need to 
be considered are the bubble diagrams. The one-loop RG is the full story. 
Consequently, the action (12.37) is a stable fixed point if Vij > 0. A system 
of fermions which is controlled by this fixed point is called a Fermi liquid. 



12.5 Quartic Interactions for A Finite 

The scaling of generic four-fermi interactions is quite awkward for calcula- 
tions at a finite frequency or temperature scale because the u's don't scale 
continuously. Thus, the scaling of a physical quantity which depends on the 
u's is determined not by the scaling of the it's, which is marginal, but on 
the number of non-zero u 's, which is scale dependent (except in the impor- 
tant case where the quantity is determined by forward, exchange, or Cooper 
scattering - which do scale continuously). For such calculations, a different 
scaling transformation is useful. Suppose A is small but finite. Then, in two 
dimensions, we can consider nearly forward scatering, from k±, &2 to k%, k± 
with \k\ — k%\ < A, \k2 — ki\ < A. Since all of the action is taking place in the 
neighborhods of fii, f^, we focus on these points. We construct cartesian 
coordinates, k x (tangent to the Fermi surface) and k y (perpendicular to the 
Fermi surface), at these two points on the Fermi surface. In the vicinity of 
these points, 

e(k x ,ky) = v F (ky + ^j (12.38) 

We now scale to fii, Cl2, using the scaling k y — ► sk y , k x — ► s l l 2 k x , 
co — > suj. (We have assumed d = 2; in d > 2, there are d — 1 momenta 
which scale as k x .) The same answers are obtained with either scaling 
transformation; it's just that some calculations are easier with this one. 
On the other hand, it's a less natural renormalization group transformation 
because it involves selecting preferred points on the Fermi surface and scaling 
differently at different points on the Fermi surface. Let's briefly see how this 
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Figure 12.5: A one-loop self-energy correction, 
works. The quadratic part of the Lagrangian is of the form: 

S = J dto dk y dk x {^ (^iuj - v F (k y + V'} (12.39) 

Hence, the field now scales as ip — > s~ 7 / 4 ip, so four-fermi interactions, 
S4 = J dwidw2du3d 2 kid 2 k2d 2 k%u{ki,k2,k3)^\k4 L ,u 

(12.40) 

scale as s 1 / 2 .The scaling is perfectly continuous. If k\, fc 2 , k$, k± = k\+k 2 — k^ 
lie within the cutoff A, then they continue to do so under this renormalization 
group transformation. If we insert a 5(k\ x — k^ x ) or 8{k\ x — k±x) into the 
integrand, then we get a marginal interaction, namely forward scattering, as 
before. 5 To see why this is a useful scaling, consider the diagram in figure 
12.5. It has a real part, proportional to F 2 u> which comes from the marginal 
forward scattering interaction, and an imaginary part, proportional to F 2 fU> 2 
coming from irrelevant non-forward processes, in agreement with the explicit 
calculation which we did in chapter 13. The above scaling immediately yields 
the suppression of F 2 j with respect to F 2 by one power of u, a result which 
is more cumbersome to derive with the other RG transformation. 

12.6 Zero Sound, Compressibility, Effective Mass 

As a result of the preceeding analysis, the density-density correlation func- 
tion can be computed by summing bubble diagrams. Other diagrams are 
down by powers of l/N. The approximation which consists of neglecting 
these other diagrams is called the RPA, or random-phase approximation 
(for historical reasons). The bubble diagrams form a geometric series which 
may be summed. Let us consider the simplest case, in which F is a constant, 
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Figure 12.6: The geometric series of bubble diagrams which determine 
(p(q,iu) p(-q,-iu)) to 0(1). 



(p(q, iu) p(-q, -iu)) = I(q, iuj) + (I(q, iu)) 2 F + ... + (I(q, iu;)) n+1 F< 

I(q,iu) 



1 - I(q, itu)F Q 



(12.41) 



where I is the value of a single particle-hole bubble. In the limit of q <C /cp, 
this is: 

de d 3 k 



/de d k 
2k (2tt)3 ^ ^ + ^ + ^ 



2 I de dldip 



1 



2-7T (2-7r) 3 v y ie — vfI ie + iuj — vfI — VFq cos ( 

dl n 9(1) -9(1 + q cos 9) 
2 a(cos 0) — 



(2tt)* 

^ /d(00B(?) 



— t^gcosi 

Q COS 



hp 



. dx — 

Ait 2 vfJ--\ —-x 



iuj — VFq cost 

X 



2tt 2 



Vf 



1 ico ^ / iuj + f_Fg 

2 vj?g \zw — VFq 



- 1 



(12.42) 



The retarded density-density correlation function is a response function, 
— Xpp, of the type which we discussed in chapter 7. If we imagine changing 
the chemical potential by a frequency- and wavevector-dependent amount, 



12.6. ZERO SOUND, COMPRESSIBILITY, EFFECTIVE MASS 



219 



Sfx(uj,q), then the action changes by 

Hence, following the steps of chapter 7, we have 

(6p(u, q)} = x PP (u, q) Sfj,(u, q) 



(12.43) 



(12.44) 



Since it reflects the density change resulting from a variation of the chemical 
potential, it is called the compressibility. As usual, a pole of Xpp{li ^) on 
the real axis is a propagating mode. According to (12.41), there is such a 
pole when: 



1 

y 



kp 



2tt 2 vf 



1 w 



2 VFq 



In 



or, 



1 



kp 



2it 2 vf 



s In 



uj + VFq 
oj — VFq 



s+1 



1 



1 



1 



(12.45) 



(12.46) 



where s = uj/vFq The solution of this equation occurs for s > 1, i.e. there is 
a mode with go = svpq- In other words, this mode, called zero sound, has a 
velocity of propagation, svp, which is greater than the Fermi velocity. Figure 
12.7 shows the allowed phase space for a particle-hole pair. The continuum 
of states composed of a particle- hole pair lies beneath the line u> = VFq 
for q small. Since the zero-sound mode lies outside this continuum for q 
small, it cannot decay; this explains why it is a propagating mode. Energy 
and momentum conservation do allow it to decay into multiple particle-hole 
pairs, but the interactions which would allow such decay are six-fermion and 
higher interactions which are highly irrelevant. 
According to (12.41), 



x°p P (q,u) 



l + F oX ° pp (q,Lu) 



(12.47) 



where x^ p is the compressibility in the absence of interactions. Let us con- 
sider the static compressibility, Xppil ~^ 0,0), for both the interacting and 
non-interacting systems. From (12.42), 



x° PP (q 



0,0) 



2tt 2 vf 
nfkp 



(12.48) 
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Figure 12.7: The allowed uj, q values of the the zero sound mode and the 
continuum of particle-hole excitations. 



where m* = kp/vF- Hence, 

x PP {q 0,0) 



X°p(<7-0,0) 1 + J^ Fq 

2w z vf u 



(12.49) 



The compressibility is decreased by interactions if we assume that m* is the 
same in both the interacting and non-interacting systems. However, this is 
usually not the case. 

Consider the behavior of a Fermi liquid under a Galilean boost by 8v = 
6p/m. The kinetic term in the action transforms, but the potential energy 
is invariant. 

S = j drd d x (V (d T - ^ + ^(x)ip{x)V{x-x')ip\x')ip{x')^ 

(12.50) 

Similarly for the Hamiltonian, 

H = j d d x (V (Q^ -A *)) ^ + ^ t ( f )^( f ) y ( f - f ')V' t (^)V'(^)) 

(12.51) 

so the energy transforms as: 

5E = P- 5p/m (12.52) 
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where P is the total momentum. If we consider a state with a filled Fermi 
sea - which has momentum zero - and a quasiparticle at the Fermi energy, 
and we boost the system in the direction of the quasiparticle's momentum, 
then 

SE = k F 5p/m (12.53) 

On the other hand, we can compute the energy change using Fermi liquid 
theory. The boost shifts the quasiparticle momentum by bp and also moves 
the Fermi sea by this amount. This doesn't affect its momentum to lowest 
order in 6p, but it does change ipjtpi by 5p cos 0j. Hence, the energy shift of 
this state is also 

cos o 

= v F Sp + 5p\Fi (12.54) 
Hence, comparing these two expressions and using m* = &f/Vf, we have 

(12.55) 

Consequently, the ratio of the interacting and free compressibilities is: 
X PP {q^ 0,0) _ 1 + 3 (sfe) 



m 
m 



3 \2ir 2 v F J 



xgT(ff-»0,0) l + _^F 



(12.56) 



a ratio which depends on the relative strengths of Fq and F\. 
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CHAPTER 13 



Electrons and Coulomb Interactions 



13.1 Ground State 

Thus far, we have assumed that there are only short-range interactions be- 
tween the fermions in our system. This assumption is appropriate for 3 He, 
but not for electrons in metals which interact through the Coulomb interac- 
tion, V(r) = ne 2 jr. When the Coulomb interaction energy is large compared 
to the kinetic energy, we expect the electrons to form a Wigner crystal. If, 
on the other hand, the Coulomb energy is small compared to the kinetic 
energy, we expect the electrons to form some kind of liquid state; later in 
this chapter, we will show that this liquid is a Fermi liquid. 

A naive comparison of these energies estimates the kinetic energy by 
the kinetic energy of a free Fermi gas and the interaction energy from the 
average Coulomb energy of a system of electrons at that density: 

^Coulomb = (cQnst) ^ (131) 
-C'Kinetic 

where r s is ratio of the interparticle spacing to the effective Bohr radius in 
the metal 



i 

3 V 



Ann 



(13.2) 



and ao = 1/mne 2 is the effective Bohr radius in the metal. Stated differently, 
charge e is enclosed within a sphere of radius r s ao. r s is the controlling 
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Figure 13.1: The graphical representation of Coulomb interactions. 

parameter for many of the approximations which we make in this chapter. 
r s is small in the high-density limit where we expect a Fermi liquid and large 
in the low-density limit where we expect Wigner crystallization. 

Better estimates of the Wigner crystal and electron liquid energies can 
be obtained in the low- and high-density limits for a model of electrons in 
a fixed uniform background of positive charge (the jellium model) . In the 
Wigner crystal state, this can be estimated to be: 

SI „ ™» _ H (13 .3) 

n rj r s 

in units of the Rydberg, 13.6eF. 

In the liquid state, this can be computed perturbatively from the La- 
grangian: 



S = J dr J d 3 k l^^+i-jc^t^^ 

+ J dr j d 3 kd 3 k'd 3 q (V (k + q)ip(k) - n5(q)^j 



47TKe 



' - q)iP(k') - n8(qj) (13.4) 



The Coulomb interaction is represented by a dotted line, as in figure 13.1 

If we expand the ground state energy perturbatively, the zeroth-order 
term is the kinetic energy. The first-order terms come from diagrams of (a) 
and (b) of figure 13.2. The first - or Hartree - term vanishes as a result of 
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b) 




Figure 13.2: The (a) Hartree and (b) Fock contributions to the ground state 
energy of the electron gas. 

the neutralizing background (i.e. the nd(q)). The second - or Fock - term is 
non-vanishing. In the Hartree-Fock approximation, the ground state energy 
is given by: 



2.2099 0.9163 



(13.5) 



where the first term is the kinetic energy and the second term is the exchange 
energy. 

The next terms in the expansion in r s come from summing the diagrams 
of figure 13.3. The first term in this series is infrared divergent, but the sum 
is convergent: 



Ei 



2.2099 0.9163 



0.094 + 0.0622 In r s 



(13.6) 



This is the sum over bubble diagrams - the Random Phase Approximation - 
which we encountered in the small A/kF approximation for a Fermi liquid. 
In this context, it is justified for a calculation of the ground state energy in 
the small r s limit since the neglected diagrams give contributions of 0(r s ). 

For r s large, the ground state of the electron gas is the Wigner crystal. 
For r s small, it is the liquid state, the nature of which we discuss in this 
chapter. 



13.2 Screening 



In the presence of Coulomb interactions, naive perturbation theory is in- 
frared divergent because the interaction V(q) = 47TKe 2 /g 2 (unless otherwise 
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O 



Figure 13.3: The RPA contributions to the ground state energy of the elec- 
tron gas. 

specified, we work in d = 3 in this chapter) is singular in the ^ — > limit. 
In the language of the last chapter, we cannot divide the Fermi surface into 
./V patches and justify Fermi liquid theory in the large TV limit because the 
interaction V(q) = Anne 2 /q 2 is singular within a single patch when q — > 0. 

However, the 'bare' Coulomb interaction, V(q) = 4irne 2 /q 2 , is not the 
actual interaction between two electrons. In fact, the interaction between 
any two electrons will be far weaker because all of the other electrons will 
act to screen the Coulomb interaction. The correct strategy for dealing 
with electrons with Coulomb interactions is to do perturbation theory in 
the screened Coulomb interaction. This can be done systematically, as we 
show in the next two sections. 

First, however, we recall the Thomas-Fermi model, a simple model for 
screening which illustrates the basic physics in the low q,u> limit. To un- 
derstand the physics at q — ► 2k f, we'll have to use more sophisticated ap- 
proximations such as the RPA. Let us imagine that we have test charges 
described by p cxt . In a metal, they will induce a charge distribution p inc j. 
According to the Laplace equation 



k is the dielectric constant due to the ions and the core electrons. In the 
Thomas-Fermi aproximation, we assume that (j) is very slowly-varying so 
that we can make the approximation that the local density is given by n(fx + 



-V 2 (f> = 471-pcxt + 47rp ind 



(13.7) 



K 



e</>(r)): 



And(0 « -e (n(fi + ecj)(r)) - n{jj)) 
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Then, 



1 9 9 dn 



e 2 |%(r) (13. 



g 2 + 47re^— U = 47rp ext (13.9) 

K OjJ,/ 

In other words, the bare Coulomb interaction has been replaced by a screened 
Coulomb interaction: 



Anne 2 AnKe 2 



q 2 q 2 + kl 

or 

1 e~ kor 



r r 

where ko is the inverse of the Thomas-Fermi screening length, 

i. 



(13.10) 



(13.11) 



ko= [ 47TKe dJl) (13 ' 12) 
For a free Fermi gas, fi = (37rn) 2//3 /2m, so the screening length is 



/ , \ l/3\ 2 



1 

k ° L = {i{i) ) a ° r ° (13 - 13) 

When r s is small, i.e. when the density is large, the screening length is 
short and the Coulomb interaction is effectively screened. When this is 
true, we expect the potential to be slowly-varying and the Thomas-Fermi 
approximaton to be reasonable. When r s is large, however, the screening 
length is large and we do not expect the Thomas-Fermi approximation to 
be valid. 

A more refined result may be obtained by by replacing the bare Coulomb 
interaction of figure 13.1 by the sum of the diagrams of figure ??. Restricting 
attention to the sum of bubble diagrams is, again, the RPA approximation. 
The effective interaction, V^f A (q, cj), is: 

V c Y A (q, u>) = V{q) + V(q) I(q, u>) V(q) + V(q) I(q, u) V(q) I(q, u) V(q) + ... 

= V[q \ „ (13-14) 

l-I(q,u;)V(q) 1 ' 
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where I(q,u) is the particle-hole bubble which we evaluated in the last 
chapter. For q small, 

V(q) 



k! pa M) 



l + ^V(q) 

ATTKe 2 A nm*k F 

— ^ h 47TKe - „ 

q A ZTT^ 

Anne 2 



which is the same asthe Thomas-Fermi result. However, for the RPA 

result contains additional information about the dynamics of the electrons. 
Also, for q — > 2k F , the RPA result contains information about the Fermi 
surface. Of course, it is not clear why we can restrict attention to the 
sum of bubble diagrams. As we will see below, this sum gives the leading 
contribution in r s in the limit of small co,q. For w / and q — ► 2k f, the 
RPA approximation can be called into question. 



13.3 The Plasmon 

Although Coulomb interactions are ultimately screened and therefore allow 
a Fermi liquid treatment, there are non-trivial differences with the case of 
short-range interactions. The zero-sound mode no longer has linear disper- 
sion, oj = v s q. This may be seen at a classical level from Maxwell's equations 
together with the continuity equation. 

-V 2 (f) = 47rp 

K 

m — = ne Ve> 
dt 

i + ^'=° < 13 - 16 > 

Combining these equations for a longitudinal disturbance, j = \j\q/\q\, we 
have 

+ J P(9, w) = (13.17) 

Hence, the frequency of a longitudinal density modulation is the plasma 
frequency, uj p 

( 4irnne 2 \ 1 . 
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rather than the gapless dispersion of zero sound in a neutral Fermi liquid. 

The same result may be seen by, again, considering the RPA sum of 
bubble diagrams which determines the density-density correlation fucntion. 
The Landau parameter, Fq, is replaced in (12.41) by the Coulomb interac- 
tion, V(q). Consequently, the pole in this correlation function now occurs 
at: 



hp 



V{q) 2ir 2 v F 



1 _UJ_ 

2 VFq 



In 



lo + VFq 
to — VFq 



1 



(13.19) 



On the right-hand-side, take the q — ► limit with q <C u. 



hp 



Airne 2 2tt 2 vf 



hp 



UJ 
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2 VFq 



, VFQ 



, ( VFQ\ 2 

3 2tt 2 vf \ a) / 



+f(^r+-) 



or, 



uj 2 = Anne 2 



VFkp 

6tt 2 



(13.20) 



(13.21) 



which is the same as (13.18). 

Since V(q) — ► oo as q — > 0, (p(q,uj) p(—q,—u)) — > in this limit. One 
might be tempted to conclude that the compressibility of the electron gas 
vanishes. However, the density-density correlation function gives the com- 
pressibility in response to the applied field: 



${p(q,u)) = (p(q,uj) p(-q,-u)) 54> cxt (q,uj) 
In linear response, <j>md(q,w) is given by, 

Airne 2 



q 



2 Xp P (q,v)<t>(.q,u) 



(13.22) 



(13.23) 



Hence, 



47TKe 2 



S (p(q, u)) = (p(q, u) p(-q, -co)) 1 + X ° (q, u) 6<f>(q, u) (13.24) 



so the compressibility is finite as q — > 0. 

In this section, we will show, following Bohm and Pines, how to separate 
the plasma oscillation from the rest of the degrees of freedom of an electronic 
system. When this is done, the remaining electronic degrees of freedom in- 
teract through a short-ranged, screened Coulomb interaction. Essentially, 
gauge invariance tells us that longitudinal photons - whose exchange gives 
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rise to the Coulomb interaction - and density fluctuations are not distinct 
objects. When long- wavelength density fluctuations aquire a mass gap as a 
result of their self-interaction, they (and the longitudinal photons to which 
they are equivalent) can no longer propagate over long-distances. Conse- 
quently, the Coulomb interaction becomes short-ranged. 

To exhibit this clearly, we make the following manipulations: 

• Electrons with Coulomb Interactions. We begin with the action of a 
system of electrons with Coulomb interactions: 



S = J dr J d 3 k d T i/j + ^k 2 Y* 1 '(k)ip(k) 

dT J d 3 kd 3 k'd 3 q^(k + q)^(k)—^^(k'-q)ii(M^5) 



Electrons interacting with Longitudinal Photons. The long-range 1/q 2 
interaction results from integrating out the longitudinal part of the 
electromagnetic field. We could equivalently write this as 

S = J dT J d 3 k (^d T ^ + e^A ^ + ^-^(k)(k + eiy ^(k)\ 

+ [ dT [ d 3 k—E(k)E(-k) (13.26) 

J J 8lTK 

The magnetic part of the electromagnetic action has been dropped 
since we assume that all velocities are much smaller than the speed 
of light; we keep only the longitudinal modes of the electromagnetic 
field. Equation (13.25) is obtained from (13.26) by integrating out the 
electromagnetic field. To do this, we will choose Coulomb gauge, Aq = 
0. In doing so we must, however, impose the Gauss' law constraint 
which is the Aq equation of motion. 

S = J dT J d 3 k (^d T ^ + ^i^(k)(k + eXy^(k)\ 

+ / dT [ d 3 k—(d T A(k)) 2 + — !— A (v ■ E — AnrKetjj^pn) 
J J 8ttk Attk V / 

Note that A(k) is a scalar field because it is only the longitudinal part 
of the electromagnetic field - which not independent of the density 
fluctuations of the electrons. The real dynamics of the electromagnetic 
field is in its transverse components, which do not enter here. If we 
were to integrate out A(k), then, since A(k) is gapless at tree- level, 
we would get Coulomb interactions between electrons. However, a 
tree-level analysis misses the fact that A(k) is, in fact, not gapless. 
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Integrate out Short-Wavelength Photons. Instead of integrating out 
A(k) fully, let us instead only integrate out those modes of A(k) with 
k > Q for some Q. Bohm and Pines did this at the Hamiltonian level, 
by applying a canonical transformation of the form: 



U = e 



-if£ d Z qa (q)V^ fd?ktf(k+qMk) (13.28) 



to the Hamiltonian corresponding to (13.27). A is the upper cutoff 
and Q is a wavevector to be determined. 

Then, we obtain an action of the form: 



S = J (It J d 3 k ($d T il> + ^^(k)(k + eiy 4>(k) 

+ J dT^^q (J-(d T A(q)) 2 + -±-Aq (v • E(q) - 4™/>V) 
dr J d 3 k d 3 k' J d 3 q^(k + q)^{k) ^ (k' - q)i/>(tfy29) 



Notice that the four-fermion interaction is now short-ranged since it 
is restricted to \q\ > Q. 

• Isolate the term which gives a gap to long-wavelength Photons. We 
now expand (k + eA) 2 : 



S = j dr j d 3 k ^ f W+|:^(Jfe)#) 



2m 

+ j dr j d 3 kd 3 k' j d 3 qtp Jt (k + q)i>(k) i/J(k' - q)tp(k') 

+ j dr j d 3 kj^d 3 q^ (^+f) ■A(-q)^(k + q)iP(k) 

r r rQ rQ 2 

+ J dr J d 3 kj d 3 q J d 3 q' ^-A{q)A(q') ^ (k - q - q')^{k) 

+ j d ^J^d 3 q (^-\d T A(q)\ 2 + -^-A (V • E{q) - 47r/«e^^0 



We split the third line into a part which comes from the average den- 
sity, n, and a part resulting from fluctuations in the density: 

dr J d 3 k J d 3 q J d 3 q' ±-A(q)Atf) ^ (k - q - q')^(k) = 
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/fQ fQ 2 / r 

dr I d 3 q / d 3 q' ^-A{q)A{q') ( / d 3 k^(k -q- q')ip(k) - nS(q + q') 

(13.31) 

the first term on the right-hand-side can be combined with the \d T A{q)\ 2 
term to give Sp in the action: 

S = S FL + S P + S Int + S c (13.32) 

with 

Sfl = j dT j d 3 k U^d T ^ + ^^{k)^{k)\ 

+ J 'dr f d 3 kd 3 k' J A d 3 q^ {k + q)^{k)^^ ^ {k' - q)^{k') 
Sp= sLI ^ly^ {\ d rM<l)\ 2 + "l\A{q)\ 2 ) 
Smt = J dT j d3k j o (k + ^YA(-q)^(k + q)^(k) 

+ J dT J d 3 qj d 3 q' ^-A(q)A(q) i^J d 3 k^(k -q- q')ip(k) - n5{q + q) 



Q 



o 



1 
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S c = dT d 3 q A) V • E{q) - inne^ij (13.33) 



S'fl is the action of electrons with short-range interactions. Sp is the 
action of plasmon modes A{q) with \q\ < Q; these modes have fre- 
quency u p . If we were to integrate them out now, they would mediate 
a short-range interaction, not the long-range Coulomb interaction. Sint 
describes the interaction between electrons and plasmons. Sc imposes 
the constraints which eliminate the additional degrees of freedom in- 
troduced with the plasmons; these degrees of freedom are not gauge 
invariant and are, therefore, unphysical. 

By separating the plasmon from the other electronic degrees of free- 
dom, we have obtained a theory of electrons with short-range interac- 
tions. The basic physics is already clear from (13.32). However, we 
are not yet in a position to make quantitative predictions. The inter- 
action depends on a free parameter, Q, and is not the Thomas-Fermi 
interaction in the cj, q — > limit. To understand the electron gas at a 
quantitative level, we must consider Si nt and Sc- 
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13.4 RPA 

The conclusions which we drew at the end of the previous section were based 
on a neglect of Sint and Sc- In this section, we consider S\ nt and the RPA 
approximation which simplifies it. We have used the term RPA in several 
contexts. The definition of the RPA is the following. We neglect the coupling 
between p(q) and p(<?) if q / <f ' ■ In the computation of a correlation function 
at q, we only consider diagrams in which the dotted Coulomb interaction 
line carries momentum q. In other words, V{c?) does not appear in these 
diagrams unless q = if . The RPA is justified in the limit of small r s and the 
limit q — ► 0. For the density-density response function or the ground state 
energy, this amounts to keeping only the bubble diagrams and neglecting 
other diagrams. 

The first step is to choose a Q which optimizes Sfl + <Sp , thereby making 
the effect of Si n t as small as possible. Without proof, we state that we can 
minimize the energy of the ground state of Sfl + Sp (computed to lowest 
order in the screened Coulomb interaction) by taking 

Q^k F (jY (13.34) 

Physicaly, Q must be finite since, for q large, the plasmon mixes with the 
particle-hole continuum and is no longer a well-defined mode. 

We now make the Random Phase Approximation, or RPA and completely 
neglect the term: 

dr / d 3 q / d 3 q' —A{q)A{q') ( / d 3 k^(k - q - q')^(k) - nS(q + q') 

(13.35) 

in Si n t. To justify the RPA, consider the effect of this term on the ground 
state energy. It shifts this energy by 

3 



(jY Q 3 u> p (13.36) 



Hence, the random phase approximation is valid in the small r s limit since 
the energy shift is small compared to the plasmon zero-point energy. 
We are now left with the action 



Srpa = j dr J d 3 k (y^ + l^v^)^) 
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+ J dr J d 3 kd 3 k' J d 3 q^{k + q)^{k)—^-^{k'-q)^{k') 

+ ^ / dT jy ,( l (\d T A{q)\ 2 +ul\A(q)\ 2 ) 

+ J dr J d 3 kj^d 3 q^- (&+f) ■ A(-q)1>\k + q)1>(k) 

+ / dT lo ^ q ' ^ ~ 47rKe ^)) (13.37) 

If we could ignore the last line, we would have a theory of electrons with 
short-range interactions together with gapped plasmons. At frequencies or 
temperatures much less than uj p , we can ignore the plasmons, so we would 
have a Fermi liquid. However, the constraint cannot be ignored. Treating 
the electrons and plasmons as fully independent would be a double-counting 
of the degrees of freedom of the system. What we can do, instead, is de- 
couple the plasmon from the electrons. When this is done, the constraint 
will only involve the particles. If we ignore the constraint - which is now a 
constraint on the electrons alone - then we can apply Fermi liquid theory 
to the electronic action. Fermi liquid theory (as we saw in the last chap- 
ter) instructs us to compute only bubble diagrams to obtain the screened 
Coulomb interaction. 



13.5 Fermi Liquid Theory for the Electron Gas 

Following Bohm and Pines, we now perform a canonical transformation, 



O -► e~ iS Oe iS 

Ix^e-^lx) (13.38) 



generated by S: 



L ' I uj(q, ip, ift) — q- k + q 2 /2m ^ 2 ^ 



m 



where 

('^(/Aff'C)*))^! (13.40) 



The principal results of this canonical transformation in the limit r s — > 

are: 
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• The elimination of the Aip^ip interaction between plasmons and elec- 
trons 

• The modification of the plasmon action to: 

SP= ^J dT iy Zq {\ d rMq)\ 2 +"(q^,^)\A(q)\ 2 ) (13.41) 

• The replacement of the cutoff Coulomb interaction by the RPA screened 
Coulomb interaction, 

M Attkp 2 f 

J d 3 q^(k+q^(k) —^-^{k'-q)^{k') - J d 3 q^(k+q)^k)V RPA {q)^{k'-q)^(k') 

(13.42) 

• The elimination of the plasmons from the contraints. The constraints 
now read: 

/ d 3 k _^ {k + q)m = o (13 . 43) 

J {u{q^^)f-(q-k-q*/2m) 

for \q\ < Q. 

Hence, we now have a theory of weakly-coupled electrons and plasmons. 
The electrons interact through a short-ranged interaction (which can be 
obtained by summing bubble diagrams). The contraints reduce the number 
of degrees of freedom of the electrons. For Q small, this is assumed to have 
a small effect on the electronic degrees of freedom. 
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CHAPTER 14 



Electron-Phonon Interaction 



14.1 Electron-Phonon Hamiltonian 

14.2 Feynman Rules 

14.3 Phonon Green Function 

14.4 Electron Green Function 

Let us consider the electron-phonon interaction, 



which couples electrons to transverse phonons. What effect does this have 
on the electron Green function? 

The one- loop electron self-energy is given by the diagrams of figure 14.1. 
The first diagram just shifts the chemical potential. At zero-temperature, 
the second diagram gives a contribution: 




(14.1) 



T,(ie,k) = g 2 



1 
I 



du d 3 q 



Q{ie — iuj, k — q) V(ito, q) 




2vr (2tt) 3 
dm d?q 



(14.2) 



2vr (2tt) 3 



ie — iuj — ik-q —u 2 — v 2 q 2 
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Figure 14.1: The one-loop diagrams contributing to the electron-self-energy. 

Closing the contour in the upper-half-plane, we pick up the pole at iuj = 
ie — £k-q if £,k-q > and the pole at iu = —vq: 



+ 



q 2 9(Hk- q ) 



-2qv ie + vq- £ fe _ g (j e - £ k _ q ) 2 - v ^ q 
Analytically continuing ico — ► co + id to obtain the retarded self-energy, 



(14.3) 



Rc£ 



ret 



(e,k)=g 2 J 



d 3 q 
(2^)3 



1 



+ 



q 2 0(Hk- q ) 



-2v e + vq- Ck- q (e - £ fe _ 



2 2 2 



14.4) 



At small e, dropping a constant shift in the chemical potential, we have: 



Re£ rct (e,fc) = g e j ^— 



d 3 q 



2£fc-<? Q 2 & (Ck-q) 



14.5) 



^{vq-ik-qY {ik-q 2 -V^Y\ 

We can take k — ► kp in this integral to obtain the leading behavior: 

ReS rct (e,fc) ~ 5 2 e (14.6) 

Meanwhile, 

ImS ret (e, k) = g 2 J ^ ± [- (1 - 9 (6-,)) 5 (e + vq- - 9 (&_,) «5 (e - ug - &_,)] 
= ^2 1 q 3 dq d(cos 0) [- (1 - 9 (&_,)) <5 (e + ug - ^-,) 



- # (Cfc- g ) ^ (e - vq - i k _ q) \ 
For e small, q ~ e, and terms in the 5 function can be dropped: 

ImS ret (e, k) = J q 3 dq d(cos 9) [-{1-9 (&_,)) 5 (e + (t> + v F cos 9) q) 



(14.7) 
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- {£ k -q) S(e-(v-v F cos 6) q)\ 
g 2 e 3 (14.8) 



14.5 Polarons 
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CHAPTER 15 



Rudiments of Conformal Field Theory 



15.1 Introduction 

In finite-temperature equilibrium statistical mechanical systems at their crit- 
ical points or quantum systems at T = with z = 1, scale invariance and 
rotational invariance (or pseudo-Lorentz invariance) give rise to the larger 
symmetry algebra of conformal transformations. In two or 1 + 1 dimensions, 
this symmetry algebra is infinite-dimensional. Thus, it places strong con- 
straints on the critical spectrum and correlation functions - much stronger 
than those due to, say, SU(2) symmetry. Some special conformal field the- 
ories - but, by no means, all - can be completely solved essentially through 
the representation theory of the conformal algebra (or, rather, its quantum 
version, the Virasoro algebra). Conformal field theory is a highly-developed 
subject which has taken on a life of its own, in part due to applications to 
string theory, so it is the subject of many full-length books and long review 
articles. Such an in-depth treatment would be out of place here. Rather, 
we would like to emphasize some of the key points, and use conformal field 
theory to illustrate some important concepts relating to critical points, on 
which this part of the book is focused. We also hope that our treatment of 
conformal field theory will help orient the reader who is interested in delving 
more deeply into the literature. 

The techniques which we will use in this section, which are primarily al- 
gebraic, are the first truly non-perturbative ones which we have encountered 



241 



242 CHAPTER 15. RUDIMENTS OF CONFORMAL FIELD THEORY 



so far in this book. Once interesting feature, which will recur in the non- 
perturbative methods discussed in later parts of the book, is that the analysis 
is most easily done in real space rather than momentum space. The advan- 
tages of momentum space for perturbation theory - the ease with which 
differential operators are inverted and the simple expression of translational 
invariance as momentum conservation - are outweighed by the advantages 
of real space which we will discover in the following pages. 

15.2 Conformal Invariance in 2D 

A conformal transformation is any coordinate transformation which only 
changes the metric by a scale factor. We will be working in flat Euclidean 
space or Lorentzian spacetime, with metric r]^ = diag{\, 1, . . . , 1) or rj^ = 
diag(—l, 1, . . . , 1), so this means that rj^ u transforms as rj^ — ► X(x) r]^ u . 

x^ 1 -> + e» (15.1) 



ds 2 = r ]f , u dx^dx u -► r]^ (dx fl + d a e»dx a ) (dx v + d p e v dx^ 

= ri^dx^dx" + {d„e v + d v e^) dx»dx v (15.2) 

Such a transformation will be a conformal transformation if 

{d^y + d v e^) oc rj^ (15.3) 

Comparing the traces of both sides, this can only be satisfied if 

2 

(df,e u + <%e M ) = - (d a e a ) rj^ (15.4) 

For d > 2, there is a finite-dimensional group of such transformations 
comprised of translations, rotations, scale transformations, and special con- 
formal transformations, which are inversion-translation- inversion combina- 
tions. In d = 2, this condition is simply the Cauchy-Riemann equations: 

die 2 = -d 2 e 1 

diei = <9 2 e 2 (15.5) 

If we write z,z = x 1 ± ix 2 , and e,e = e 1 ± ie 2 , then e is a holomorphic 
function of z, e = e(z), while e is anti-holomorphic, e = e(z). 

Writing our coordinates and fields in terms of holomorphic and anti- 
holomorphic quantities will prove to be such a powerful tool, that all of the 
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subsequent development will be carried out in real space - or, rather, in the 
extension of it to the two complex dimensional space (z,z) £ C 2 , which 
contains real space as a section, z = (z)* - rather than in momentum space 
where most of our previous discussion of field theory has taken place. 

This decoupling between holomorphic and anti-holomorphic - or right- 
and left-moving - degrees of freedom is a general feature of conformal field 
theory. Indeed, we can take this as a definition of a conformal field theory: 
a 2D quantum field theory which has correlation functions which decouple 
in this way. As we will see, this means that the correlation functions have 
simple scaling properties. Consider a free scalar field. Its equation of motion 
is d^d^ip = 0. We have been careful to write upper and lower indices 
because the metric tensor, which is used to lower indices, takes the form 
Vzz = rfzz = 0, r/zz = rfzz = \i in complex coordinates. Hence, the equation 
of motion takes the form d z c\(p = dd<p = 0, where we have introduced the 
notation d = d z , d = c\. Thus, <p is the sum of an arbitrary holomorphic 
function and an arbitrary anti-holomorphic function. 

The (classical) algebra of conformal transformations also decouples in 
this way. Consider the transformation z — > z — ez n+1 , where e is infinitesi- 
mal. Such a transformation is generated by the linear operator £ n = — z n+1 d: 



The analogous operator £ n generates transformations of the zs. 

By direct calculation, we see that the £ n s and £ n s generate two indepen- 
dent copies of the same algebra: 



Note, however, that these transformations are not all globally well- 
defined. In particular, £ n = —z n+l d is non-singular as z —> only for 
n > — 1. Making the transformation z = — 1/w in order to study the z — > oo 
limit, we find that £ n = —w 1 ~ n d w is non-singular asro^O only for n < 1. 
Hence, n = 0, ±1 are the only globally well-defined transformations. £q + £q 
generates scale transformations; i (£q — £q) generates rotations; £-±, £-\ gen- 
erate translations; and £±, £\ generate special conformal transformations. 



6f(z) = f(z-ez n+1 )-f(z) 

= f(z)-df-ez n+l -f{z) 
= -ez n+1 df 
= e£ n f 



(15.6) 




(m - n) £ m+n 
(m - n)£ m+n 




(15.7) 
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The general form of a transformation generated by these operators is 

az + b 

z — > 

cz + a 



(15.8) 



with ad — be = 1. This is the group SL(2,C)/Z2 (the Z 2 is modded out 
because a, 6, c, d — »■ —a, —6, — c, — d leaves the transformation unchanged). 

The special conformal transformations, z — > z/{az + 1) are somewhat 
less familiar than the others. Scale invariance is the defining property of a 
critical theory, and translational and rotational invariance are nice features 
which one would want most theories to have (at least in their continuum 
limits), but why special conformal transformations? As we will show in 
our discussion of the energy-momentum tensor, invariance under special 
conformal transformations is automatically a property of any theory which 
is scale and translationally invariant. 



15.3 Constraints on Correlation Functions 

Now, consider the transformation properties of the basic fields in a given 
theory. Primary fields generalize the notions of vectors and tensors under 
the rotation group to the conformal algebra. A primary field of weight (h, h) 
transforms as 

- (f ) k (f (/(*) ,700) (15-9) 

under a conformal transformation z — > f(z), z — ► f(z). 

Under a rotation z — ► e ld z, z — > e~ td z a primary field of weight h trans- 
forms as $ — ► e l ( ft ~ ft ) 6 '<j); under a scale transformation z — > Az, 2 — > Az it 
transforms as $ — > A ft-+fc( I ) . Thus, a scalar under rotation has h = h, while 
a vector has two components with h = 1, h = and /i = 0, /t = 1. 

Under an infinitesimal conformal transformation (on the holomorphic 
part of the theory; the anti- holomorphic part is analogous), a primary field 
transforms as 

8& h {z) = (1 + de) h $ h (z + e(z)) - $ h (z) 

= (hde + ed)<f> h (z) (15.10) 

Hence, the correlation function (^(zi)^^)) transforms as 



8 e ($i(zi$ 2 (z 2 )> = [(hid^) + e(zi)fr) + (/i 2 d 2 e(z 2 ) + e(z 2 )<9 2 )] (^(zO^zfe}) 
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In a field theory which is invariant under the group of conformal (global) 
transformations, the vacuum state will be invariant under translations, ro- 
tations, and special conformal transformations. As a result, correlations 
functions will be invariant under these transformations. The implication of 
translational invariance may be seen by substituting e(z) = 1 in (15.11): 

(di)+3 2 )($i(zi)$ 2 (z 2 )> = (15.12) 

In other words, the correlation function is a function of z± 2 = z\ — z 2 . 
Similarly, its anti-holomorphic dependence is only on zi 2 = zi — ~z 2 The 
implication of scale and rotational invariance may be seen by substituting 
e(z) = z in (15.11): 

+ hi) + (z 2 8 2 + h 2 )\ (^i(z 1 )^ 2 (z 2 )) = (15.13) 

Combining this with the analogous equation for the anti-holomorphic de- 
pendence of the correlation function, we see that the correlation function 
vanishes unless h± = h 2 and 

($ 1 ( Zu z 1 )* 2 (z 2 ,z 2 )) = ^h ( 15 - 14 ) 

Z 12 Z 12 

Thus, the two-point correlation function is reduced to single unknown con- 
stant. Since we can always redefine our fields in order to absorb this constant 
into their normalization, the two-point function is completely constrained. 

We can also apply the constraint of conformal invariance to the three- 
point function. We find that it must take the form 

(<M*l,*l)$2(*2,*2)^3,*3)) = H 1+h2 - h3 H 2 +S- hl H 1+h3 - h2 * (£LM5) 

z 12 z 23 Z 13 

Again, the correlation function is reduced to a single unknown constant 
(which remains unknown once the normalization is determined by the two- 
point function). 

However, when we turn to the four-point correlation function, we see 
that there is real freedom here, the cross ratio (or anharmonic ratio) x = 
^12^34/ ^13^24 is left invariant by the group of global conformal transforma- 
tions SL(2,C)/Z 2 . Hence, the four-point function takes the form: 

($1(2:1, Z!)c& 2 (z 2 ,z 2 )ci>3(z3, ^3)^3(^4,^4)) = f (x,x)l[z^ hiihj+h/3 X (£6*16) 

i>j 

where h = ^2jhj. In order to solve a conformal field theory, we must de- 
termine the numbers Cijk, the functions f (x,x), and their counterparts for 
higher correlation functions. 
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15 .4 Operator Product Expansion, Radial Quan- 
tization, Mode Expansions 

As we have already discussed, the operator product expansion expresses 
the notion that two nearby particles will appear to be a single composite 
particle when viewed from a great distance. In a 2D conformal field theory, 
the operator product expansion is written in the form: 

<t>i (z,z) fa (w,w) = J2 c vk (z - w) hk - h *- h > {z - wf^-^ fa (w,w) 

(15.17) 

The coefficients are precisely the same as appear in the three-point 
function. The leading term on the right-hand-side is the most singular one, 
i.e. the operator fa of lowest dimensions hk, hk- The right-hand-side can 
be thought of as somewhat like a Taylor expansion. In addition to a given 
operator fa, all of its derivatives dfa, d 2 fa will appear. The new element, 
compared to an ordinary Taylor expansion, is that other operators which 
are formally unrelated to fa will also appear because they are generated 
under renormalization by fa, fa. 

The operator product expansion is a very powerful tool when combined 
with contour integration. This is most nicely done if one considers a sys- 
tem with periodic boundary conditions, so that the spatial coordinate x is 
restricted to the interval [0, 2ir]. Then, if we perform a conformal transfor- 
mation to coordinates z = e T ~ tx , constant radius circle in the complex plane 
are constant time slices. Time-ordering is now radial ordering: 

Now, consider the transformation property of B(z) under a transforma- 
tion generated by A = § A(z')dz' /2iri. If A(z') is purely holomorphic, this 
doesn't depend on the specific circle along which we do the integral. Hence, 
the variation in B{z) is: 

/dz' 
— (A(z')B(z) - B(z)A(z')) 

= I ^-K(A(z')B(z)) (15.19) 
J c z 

where C\ is a contour encircling the origin at radius larger than \z\, C2 is a 
contour encircling the origin at radius smaller than \z\, and C z is a contour 
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encircling the point z. The latter contour integral is determined by the short- 
distance singularity of TZ(A(z')B(z)), or, in other words, by the operator 
product expansion of A(z') and B(z). One customarily drops the radial 
ordering symbol, which is understood, and simply writes A(z')B(z). Thus, 
the expansion which is useful to us is, in fact, and expansion of the radial 
product, not the ordinary product, but the notation hides this fact. 
Let us consider, as a simple example, a free scalar field, with action 



— [ dpd<f)d 2 x (15.20) 
2-ir J 



2vr 

According to the equation of motion, 

dd<p = (15.21) 

the field can be written as the sum of a holomorphic field and an antiholo- 
morphic field, (j)(z,z) = \ (f(z) + Tp(z)). The correlation functions of these 
fields take the form 

{4>(z, z)(p(w,w)} = — - In \z — w\ (15.22) 

or, in other words, 

(ip(z)<p(w)) = - ln(z - w) (15.23) 

with the analogous equation for Tp(z). Hence, the correlation function of dip 
is 

{d<p(z)d<p(w)) = - 1 (15.24) 

{Z W) 

if is a primary field of dimension h = 1, h = 0. 
According to Wick's theorem, 

dip(z)d(p(w) = (d(p(z)d(p(w)}+ : d(p(z)d(p(w) : 

= — 1 + : d<p(z)d<p{w) : (15.25) 

(z — wy 

(Recall that a radial ordering symbol is implicit.) Expanding dip(z) = 
dtp(w) + (z — w)d 2 ip{w) + . . ., we have 

d<p(z)d<p(w) = —- — - — ^2 + : d(p(w)d(p(w) : + [z — w) : d 2 ' (p(w)d(p^iS)26) 
(z — w) 

As we will see in the next section, : dtp(z)dip(z) := —2T(z), where T = T zz 
is a component of the energy-momentum tensor. Thus, 

dip(z)d<p(w) = -- — - 2T(w) - (z - w)dT(w) + ... (15.27) 

(z — wY 
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Hence, the operator product of dip(z) with dip{w) contains the identity op- 
erator, the energy-momentum operator, and its derivatives. This operator 
plays a special role in conformal field theories, as we will see in the next 
section. 

In free field theories, it is often useful to rewrite the field operators 
in terms of creation and annihilation operators, which are their Fourier 
modes. Indeed, this was the starting point of our development of quantum 
field theory way back in chapter 2. Very schematically, this looks like 4> = 
J2k a k etkx ~ UJt ■ I n radial quantization, z = e lx+T , so the expansion in modes 
is an expansion in powers of z. Since x £ [0, 2ir], the allowed momenta 
are simply the integers k = 0, ±1,±2, if we take periodic boundary 
conditions, i.e. only integer powers of z appear in the expansion. Consider 
the case of the primary field dip: 

dip(z) = ^nz- 11 - 1 (15.28) 

neZ 

The —1 in the exponent is a matter of convention whereby we separate 
explicitly the weight of a primary field. The operator product expansion 
of two operators is equivalent to the commutation relation between their 
modes, as may be seen using contour integration. We will see an explicit 
example of this two sections hence. 



15.5 Conservation Laws, Energy-Momentum Ten- 
sor, Ward Identities 

According to Nother's theorem (see chapter ), to each symmetry of the ac- 
tion, there is an associated conserved quantity. Invariance under the coordi- 
nate transformation x M — ► x M + e M is associated with the current = T^e^, 
satisfying the conservation law = 0, where T^ v is the energy-momentum 
tensor. Translational invariance implies that 

d^Tfu, = (15.29) 

since e M is constant for a translation. Scale invariance, for which e M = Xx^, 
implies that 

= d» (T^x") 
= {d»T lxv )x»+T lxv (d»x») 

= TH (15.30) 
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Hence, the energy-momentum tensor is both divergenceless and traceless. 

As an aside, we note that invariance under special conformal transfor- 
mations comes for free once we have scale and translational invariance. The 
associated conserved quantity is K^ v = x 2 T^ u — x ll x a T au : 

d v K^ = 0" (x 2 T^) - &> (x^x a T au ) 

— 2x T^ v -\- x d T^iu T av x^g T av x^x d T au 

— 2r u T — t) u r a T — r a av T 

= (15.31) 



Let us rewrite the trace-free and divergence-free conditions in complex 

notation, ds 2 = (dx 1 ) 2 + (dx 2 ) 2 = dzdz. Hence, g zz = g-zz = 0, g zz = g zz 
l 

2 



1 and, consequently, g zz = g zz = 2. Hence, 



T£ = 
9 z *T zz + g* z T zz = 

T zz = (15.32) 

In the last line, we have used the fact that the energy-momentum tensor 
is symmetric, T zz = T zz . Thus, there are only two non-zero components, 
T zz , Tzz- They are constrained by the divergencelessness of the energy- 
momentum tensor: 

g^T^ = 
g* z dT zz + g zz dT zz =0 

dTzz = (15.33) 

Similarly, dTzz = 0. Hence, T(z) = T zz and T(z) = Tzz are, respectively, 
holomorphic and antiholomorphic. 

At the quantum level, T(z) and T(z) must implement conformal trans- 
formations according to (15.19). Under a transformation z — > z + e(z), a 
field &(z) transforms as 

/dz 
— e(z)T(z)$(w) (15.34) 

(with radial ordering understood, as usual) At the same time, a primary 
field $ of weight h must, by definition, transform as: 

5<f> = hde<S> + ed$ (15.35) 

Hence, such a primary field must have the following operator product ex- 
pansion with the energy-momentum tensor: 

T(z) $(w) = - — ^— tt §(w) + — *— d§(w) + ... (15.36) 

(z — wy z — w 
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We can take this to be the defining relation for a primary field of weight h. 

Let's check that this does, indeed, hold for a free scalar field. The energy- 
momentum tensor for a free scalar field (with action normalized as in (15.20)) 
is: 

T{z) = -\ :d<p{z)dv{z) : (15.37) 

This expression has been normal ordered in order to make it well-defined. 
There are other ways of defining the energy-momentum tensor; this is the 
usual type of ambiguity which is encountered in quantizing a classical theory. 
The operator product of T(z) with d(p(w) can be computed using Wick's 
theorem. Either of the d<p(z) factors in T{z) can be contracted with d(p(w): 

T(z) d<p(w) = — i : dip(z) d<p(z) : d<p(w) 
1\ / 1 



1 d<p(w) + . . d 2 <p(w) (15.38) 



(z — w) 2 (z — w) 

The factor of 2 in the second line comes from the two different choices for 
contraction. This is, indeed, of the form (15.36), with h = 1. 

The quantum mechanical expression of the existence of a symmetry is 
the corresponding Ward identity which correlation functions satisfy. This 
can be derived by inserting the generator of the symmetry transformation 
into a correlation function and allowing it to act on all of the other fields in 
the correlation function. Consider the following contour integral 

dz \ 
—e{z)T(z)^ ( Wl ) ...<t> n K) j (15.39) 

The contour is taken to encircle all of the u>jS. This contour can be deformed 
into the sum of contour integrals along small circles encircling each of the 
wis. Thus, 



J2 (</>i M ■ ■ ■ jf ^i<z)T{z)<Pi (m) ■■■K K)^ (15.40) 

From (15.34) and (15.35), 
/ dz 

<f> —e(z)T(z)(f>i (wi) = e(wi)d(f>i (wt) + hide(wi)4>i (wi) (15.41) 
Jd 2m 
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Hence, 

/ dz 

i ^<z) ( - + -L- (0i (u/i) • . . n KD15-42) 

f-^ / c . 2m \{z- WiY z-Wi dwi J 

Since this holds for all e(z), we can drop the integral and simply write 

{T(z)4>i (toi) . . . <j) n (w n )} = 

n ( h 1 d \ 

£ (ens? + 7^ ^) («.) • ■ ■ *. (»«» 15 -«> 

15.6 Virasoro Algebra, Central Charge 

Let's now consider the OPE of the energy-momentum tensor with itself. If 
T(z) is a primary field, then this OPE will take the form (15.36) with h = 2. 
The calculation can be done explicitly in the case of a free scalar field: 

T(z)T(w) = d<p(z)d<p(z) !) d<p(w)d<p(w) : 

If 1 \ 2 1 1 

4\ (z-w) 2 J 4 (z-w) 2 

1/2 + , 2 , 2 T( W ) + 1 - &T(w) + ... (15.44) 



(z — w) A (z — w) 2 (z — w) 

The first term on the second line comes from contracting both dips in T(z) 
with both of them in T(w), which can be done in two different ways. The 
second term comes from contracting a single dip in T(z) with a single one 
in T(w), which can be done in four different ways. 

This is almost what we would get if T(z) were primary, but not quite. 
The leading term on the right-hand-side of the OPE prevents the energy- 
momentum tensor from being primary. A term of this form scales in the 
correct way, so it is allowed in general: 

T(z) T(w) = j^—, + T(w) + — dT(w) + ... (15.45) 

\z — u>) 4 (z — w) z z — w 

Similarly, 

T(z) T(w) = -^-4 + 7^=72 T(w) + dT(w) + ... (15.46) 

\z — wy \z — w) z z — w 
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In the special case of a free scalar field, we have just shown that c = 1; a 
similar calculation shows that c = 1. It is one of the numbers which charac- 
terizes any given conformal field theory. Later, we will consider conformal 
field theories with other values of c, c. For now, let us continue to proceed 
with full generality and take c, c to be arbitrary. 

Let us consider some of the consequences of the existence of c, which 
is usually called the central charge, for reasons which will become clear 
momentarily. By taking the contour integral of (15.45) with e(z), we can 
find the transformation property of T{z) under the conformal transformation 
z — > z + e(z): 



SeT(w) = <z) T{z) T(w) 



dz , N / c/2 2 rTi . . 1 nrri . . 
T~-< z ) 7 ,4 + 7 y2 r M + + - 

Z7TZ \ (Z — Wp (Z — Wy Z — W 

= j^d 3 e(w) + 2de(w) T{w) + e(w) dT(w) (15.47) 

For e(z) = l,z,z 2 , i.e. for the algebra of global conformal transformations, 
T{z) transforms as a primary field. For a finite transformation, z — > /(z), 

The odd-looking second term is called the Schwartzian derivative. It is 
not so obvious that this is the finite transformation corresponding to the 
infinitesimal transformation (15.47), although the converse is clear. Some 
insight may be gained by checking that it vanishes for a global conformal 
transformation z — > (az + b)/ (cz + d). In fact, it is the unique such quantity, 
up to a constant coefficient. We can become a bit more comfortable with 
the Schwartzian derivative by considering the example of a free scalar field. 

T(z) = -\ :d<f(z)df(z): 

= -k m oH 2 + f)M z -!) + ;?) < 1M9 > 

The transformation properties of T(z) under a conformal transformation are 
determined by those of d(p, which is a primary field: 



!)) + ;?, 
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T(f(z))- 



= (f'(z)) 2 T(f(z))-\im 



(/(-+f)-/(--§)) 2 . 
/'(* + §)/'(*-§) l\ 



+ 



a ^ V (/(* + §) -/(*-§))' 



= (f\z)) 2 T(f(z)) - 



1 f'f"'-U f") 2 

12 



(/ 



A2 



(15.50) 



which is (15.48) with c = 1. Note that the real culprit here is the normal- 
ordering which must be done in order to define T(z) in a quantum theory. 
Thus, the ordering ambiguities associated with the quantization of a classical 
theory are responsible for making T{z) a non-primary field; classically, it is 
primary. 

Let us rewrite (15.45) in terms of the commutator between the modes of 
T(z), 

T{z) = Y,L n z- n - 2 (15.51) 



This expansion may be inverted to give the modes, L n , 



5> = f 2™ 



dz 



v n+l 



T{z) 



(15.52) 



If we take the integral of (15.45) over z and w after multiplying by z n+1 and 
w n+1 , we find 



dz 

2vri 
dz 



v n+l 



,n+l 



dw 

dw 
2^1 



w m+1 T{z)T{w) 



■ w 



m+l 



c/2 



+ 



T(w) + 



1 



(z — w) 4 (z — w) 
[L n , L m ] = — (n 3 - n) 5 m+nfi + 2(n + 1) L r 

[L n , L m ] = (n-m) L m+n + ^ (n 3 - ra) <5 m+ni o 



dT(w] 



z — w 
(m + n + 2) L m+n 

(15.53) 



Thus, we see that the classical algebra of infinitesimal conformal transfor- 
mations has been modified. It is extended by an additional term, which is 
proportional to the identity operator - hence, a central extension. 6 



15.7 Interpretation of the Central Charge 

Before taking up the representation theory of the Virasoro algebra, let's 
think about the physical interpretation of the central charge, c. 



254 CHAPTER 15. RUDIMENTS OF CONFORMAL FIELD THEORY 



The first obersvation is that the central charge is additive. If we take 
two field theories and simply add together their actions without coupling 
them, then their energy-momentum tensors add, and the central charges add 
(since there won't be any cross terms in their OPEs). N free scalar fields 
have c = N. Thus, there is a sense in which the central charge measures the 
number of gapless modes which a system has. As we will see, it weighs such 
modes differently. Fermions have different central charges from bosons, etc. 
At the end of this section, we will see that the particular form of accounting 
which is done by the central charge is the 'right' one for the purposes of 
giving insight into RG flows. 

15.7.1 Finite-Size Scaling of the Free Energy 

Further insight into the central charge, and a way of calculating it for an 
arbitrary system, is given by the following interpretation in terms of the 
finite-size scaling of the free energy. In a large but finite system of linear 
scale L, the free energy scales as: 



The bulk free energy density and boundary free energy are clearly non- 
universal since they are dimensional quantities. However, Aq can be - and 
is - universal. This means that it depends on the geometrical shape of 
the finite system in way which is independent of the particular theory under 
consideration and is completely independent of the short-distance cutoff and 
other microscopic details. 

At first glance, it is somewhat surprising that the third term in this 
expansion is proportional to InL rather than a constant. However, it can 
be seen that such a term is present by considering the effect on the free 
energy of an infinitesimal scale transformation, — > (1 + e)x M . Under such 
a transformation, the action varies by 



This follows from the definition of the energy- momentum tensor. Under 
such a rescaling, the size of the system changes by L — > (1 + e)L. Hence, 
the free energy changes by 



(3F = A 2 L 2 + A\L + Aq In L + . . . 



(15.54) 




(15.55) 



F(L + dL) - F(L) = dL — 




(15.56) 
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However, since the free energy is the logarithm of the functional integral of 
the action, the expectation value of the change in the action is equal to the 
change in the free energy, to lowest order in e: 

e -/3F(L+dL) = j e ~S~dS 

= Je' s (l- dS + 0(e 2 )) 
= e-PHL) (! _ {dS) + 0(e 2 } ) 

= e -/3F(L)-(dS) 

Hence, 

L %Hp- = -^f d2x (T Z z) (15.58) 

Hence, if the integral over the entire finite-size system of the right-hand-side 
of this equation is a non-zero, finite constant, then the free energy has a InL 
term in its expansion. 

In an infinite system in flat space, T zz = in a critical theory. However, 
on a curved space or in a finite region of flat space, it need not vanish. 
The simplest example of this is a wedge of the plane of angle 7. We can 
compute T zz in such a wedge using the conformal mapping w = z 1 ^ from 
the upper-half-plane to the wedge and the transformation law for T(z). 

W^) = (p-^wedgeH - (Q 2 - l) (15.59) 

Since (r uhp (z)) = 0, 

T wedgeH = ^(l-(^) 2 ) (15.60) 

At the corner of the wedge, w = 0, T(w) is singular. As a result of this 
singularity of T(w), it is not true that dT = 0', so the conservation law will 
require that T zz be a delta-function situated at the corner. 

<^H) 2 ))— ° 



(15.57) 
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(s( 1 -(^) 1 )) <w (-J- r - (1561) 

This gives a finite contribution to dF/d(ha.L): 




(15.62) 



By measuring how large a InL contribution there is to the free energy, 
we get a measure of c, and, hence, of how many degrees of freedom there are 
in the system. If one has a lattice model, but doesn't know what low-energy 
conformal field theories are associated with its critical points, then one could 
compute the free energy numerically and see how it scales with system size. 
Once, c is obtained, one can hope to compute all of the correlation functions 
of the theory using the techniques which we will discuss in the following 
sections - which would be a much more involved numerical calculation. 
If the theory is soluble by the Bethe ansatz, the the free energy can be 
computed analytically. The resulting central charge might enable one to 
compute the correlation functions of the theory, something which the Bethe 
ansatz solution does not give. 



15.7.2 Zamolodchikov's c-theorem 

The interpretation of the central charge as a measure of the number of 
degrees of freedom in a theory has an interesting corollary, due to Zamolod- 
chikov. Consider an arbitrary two-dimensional field theory. Let = T Z z- 
Then, we define: 

F (zz) 

(T(z,z) r(0,0)> = -^i 

(T(z,z) 6(0,0)) = £M 
z*z 
H (zz) 

(Q(z,z) 0(0,0)) = -^ (15.63) 

The forms of the right-hand-sides are dictated by rotational invariance. By 
unitarity, F (zz) > 0, H (zz) > 0. 

Meanwhile, translational invariance implies that: 



dT + dO = 



(15.64) 
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Multiplying the left-hand-side of this conservation law by T(0, 0) and taking 
the correlation function, we find 



Z 3 Z 



= zz F' + zzG' - 3G 

= F + G-3G (15.65) 

where F = z~z F' , i.e. F = dF/dt, where t = \n(zz). Similarly, we multiply 
the conservation law by 0(0,0) and take the correlation function, 

= G-G + H-2H (15.66) 

If we define the quantity C(t) = 2F(t) — 4G(t) — 6H(t), then we can imme- 
diately make two observations about C(t): 

• C(t) = c when the theory is critical, i.e. O = 0. 

• C(t) is a monotonically decreasing function of t since: 

j t C{t) = - 12 H(t) < (15.67) 

Hence, renormalization group flows always go from fixed points of large 
central charge to those of smaller central charge, and C(t) decreases along 
the flows. One consequence is that RG flows are necessarily gradient flows 
in unitary 2D theories; there can't be limit cycles or other exotic flows. 

This is sometimes justified by saying that the renormalization group pro- 
cedure involves course graining a system, so 'information' should be lost in 
this procedure, which would imply that RG flows should flow 'downhill' ac- 
cording to some measure of 'information' or entropy. However, this would 
imply that some version of the c- theorem should hold in higher dimen- 
sions. No such theorem has been found. A little further thought suggest 
that another hole in this intuitive interpretation of the c-theorem. An RG 
transformation takes a system with cutoff A and transforms it into another 
system with the same cutoff A. (The two systems are, of course, related 
by a rescaling.) Thus, there are precisely the same number of degrees of 
freedom before and after an RG transformation. Thus, the interpretation of 
the c-theorem as a statement about the information loss (or entropy gain) 
associated with course-graining is probably a little too naive. The following 
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is a more accurate interpretation. The function C(t) is the number of low- 
energy degrees of freedom, as measured by their contribution to the T — T 
correlation function or, loosely speaking, by their thermal conductivity or 
specific heat. The c-theorem may be nothing more than the statement that 
systems like to form gaps by allowing the gapless degrees of freedom to in- 
teract and form gaps. Thereby, their number is reduced and C(t) decreases. 
In more than two dimensions, it is possible to generate new gapless degrees 
of freedom by breaking a continuous symmetry, which results in Goldstone 
modes. Thus, it is not so clear in D > 2 that a low-energy theory will 
have fewer gapless modes than its high-energy parent. One could imagine 
having TV species of fermions with SU(N) symmetry which form bilinear 
order parameters which condense, thereby completely breaking the SU(N) 
symmetry. In such a case, there would be N 2 — 1 Goldstone modes, so the 
number of gapless modes - by any measure - would have to increase at 
low-energies for N sufficiently large. 



15.8 Representation Theory of the Virasoro Alge- 
bra 

Lq + Lq is the Hamiltonian. Since the right- and left-handed parts of the 
energy-momentum tensor decouple, we can separately consider Lq which is 
E + P, which is plays the role of the Hamiltonian for the right-handed part 
of the theory. The L n s are raising and lowering operators, as may be seen 
from the commutation relation: 

[L ,L n ] = -nL n (15.68) 

For n > 0, L n is a lowering operator. For n < 0, L n is a raising opera- 
tor. Hermiticity implies that Ln = L_„ since T(z) = ^ L n z~ n ~ 2 in radial 
quantization implies that T(x, t) = ^2 L n e~ m ( x ~^ in the original Minkowski 
space. We will use these raising and lowering operators to build up repre- 
sentations of the Virasoro algebra. 

Let us consider some representation. Since the energy is bounded below, 
there must be some state \h) in the representation such that 

L n \h) = h\h) (15.69) 

L n \h) = Vn>0 (15.70) 

Such a state is called a highest weight state or a primary state. The latter 
term is suggestive of a primary field, to which a primary state correspond. 
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In conformal field theory, there is a simple correspondence between states 
and operators. We act on the vacuum with with the operator 4>(z) at the 
origin z = 

10) = <K0)|0) (15.71) 

In radial quantization, z = is actually t = — oo, so this state should be 
thought of as a state with a single quanta in the distant past. Suppose 
that &h(z) is a primary field of weight h. We define 

|fc> = 3(0)|0> (15.72) 

Then, according to (15.36) 

T(z) $ ft (0) = A $a(0) + i <9<M0) + non-singular terms (15.73) 
Acting on the vacuum state and expanding T(z), we have 

T(z) # fc (0)|0) = A $ h (0)|0) + i 0$ h (O)|O) + non-sing. 
T(z)|/i) = A + I 0$ fc (O)|O) + non-sing. 

J> n z-"- 2 |/i) + 

n>0 

A|fc) + ^L,^-™- 2 ^) = A + I 9$ h (0)|0) + non-sing. (15.74) 

n<0 

Comparing the right- and left-hand sides of the last line, we see that 

LA(0)|0) = L n \h) = V n > (15.75) 

and 

L^ h (z) = d$ h (z) (15.76) 

Thus, as advertised, primary states are created by primary fields. 

Implicit in this derivation, was the notion of a vacuum state 1 0) . As usual, 
we expect the vacuum state to be invariant under the global symmetries of 
the theory. In this case, this means Lo|0) = L±i|0) = 0. Another perspective 
on this condition on the vacuum state may be gained by considering the 
trivial OPE: 

z 2 

T(z) 1 = T(0) + zdT(0) + yd 2 T(0) + . . . (15.77) 

where the identity operator is taken to be nominally at the origin. From 
this OPE, we see that L n 1 10) = L n \0) = for all n > since the identity 
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is a primary field. We further see that Lo|0) = £-i|0) = 0, as required by 
global conformal invariance. Finally, we see that L_2|0) = T(0)|0) and, more 
generally, L_„|0) = d n ~ 2 T(0)\0). For some purposes, it is useful to think of 
the primary states as 'vacuum' states of sectors - each sector corresponding 
to a different primary field - of the Hilbert space. The actual vacuum is then 
simply one of these states, the one corresponding to the identity operator 
|0) = 1|0). 

Any given conformal field theory will be characterized by a central charge 

c. It will contain some number of primary fields, (fr^, ®h 2 i If the theory 

has a fnite number of primary fields, it is called a rational conformal field 
theory. For each primary field, $h„, we have a sector of Hilbert space which 
is built on the highest weight state \h n ) by acting with raising operators. 
These states form a Verma module: 



Lq eigenvalues States 

h \h) 

h + l L-!\h) 

h + 2 Ll^h), L_ 2 \h) 

h + 3 Ll^h), L_iL_ 2 |/i), L_ 3 |/i) 

h + A Li^h), L^L^h), L_iL_ 3 |/i), L 2 _ 2 \h), L_ 4 |/i) 



The states which are obtained from the primary states by acting with 
the L n s are called descendent states. The fields to which they correspond 
are descendent fields. Descendent states must be orthogonal to primary 
states. A primary state \x) is defined by the condition L m \x) = for all 
m > 0. A descendent state, is given by = L- mi . . . L_ m J/j) for some 
-mi, . . . , -m n < 0. Then, the inner product (tp\x) = ((Y>|L_ mi . . . L- m J \x) = 
{ip\ (L mi . . . L mn \x)) = 0. A primary descendent state, i.e. a state which is 
simultaneously a primary state and a descendent state, has vanishing inner 
product with itself and all other states, so it should be set to zero in any 
unitary representation of the Virasoro algebra. 

In (15.74), we saw that L^\<&h = d&h By keeping the less singular terms 
of (15.74), we can find the other descendents of 

*rW - i-AH = j ^ {z _ l w) n-i T{z)*h{v>) (15-78) 

The descendent state can thus be obtained by acting on the vacuum with 
L- n \h) = <]?^ n (0). From our earlier discussion of the identity operator, we 
see that the energy-momentum tensor is a descendent of the identity. This 
is one way of understanding why it is not a primary field. 
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Each Verma module is a single irreducible representation of the Vira- 
soro algebra. There are infinitely many states in each representation. This 
should not be surprising since the Virasoro algebra is infinite-dimensional. 
The Hilbert space of any conformal field theory will contain some number 
(a finite number, if the theory is a rational conformal field theory) of such 
representations. It is useful to keep in mind the analogy with SU(2). A 
theory which is invariant under the SU(2) symmetry of spin roations will 
have a Hilbert space which can be broken into irreducible representations of 
SU(2): spin-0 representations, spin-1/2 representations, spins-1 representa- 
tions, etc. A spin-s representation will be 2s + 1-dimensional and there will, 
in general, be many of them for each s. Thus, SU(2) symmetry introduces 
some simplification, but only a finite amount of simplification, which is not 
enough to determine the spectrum of a theory with an infinite-dimensional 
Hilbert space. 

The Virasoro algebra is infinite-dimensional, so it imposes far more struc- 
ture on Hilbert space. If there are infinitely many primary fields, then this 
may still not be enough to render the theory tractable. However, for the 
case of rational conformal field theories, the Virasoro algebra reduces the 
problem of determining the spectrum of the theory to that of determining 
a finite set of numbers hi, hi, • • ., h n . Note that the Virasoro algebra - 
and, therefore, the algebra of infinitesimal conformal transformations - is 
not a symmetry algebra of the theory in the usual sense (i.e. in the sense 
in which we used it above for SU(2)). The operators in this algebra do 
not commute with the Hamiltonian, Lq. Rather, the Virasoro algebra is a 
spectrum-generating algebra of the theory, analogous to a, for the simple 
harmonic oscillator. 

Our discussion of Verma modules appeared not to depend at all on c or 
h. The latter was merely an offset from zero by which the entire spectrum 
was rigidly shifted. This is misleading. We have not imposed any of the 
commutation relations other than the [Lo,-L n ] commutation relations. In 
fact, it turns out that for some c, h, these commutation relations require 
that some of the states in a Verma module with a given Lq eigenvalue are 
not linearly independent. Hence, some Verma modules are smaller than one 
would naively expect from the above construction. 

The simplest example of this is the Verma module built on the identity 
operator - or, in other words, the Verma module built on the vacuum state 
|0). The vacuum state of a conformal field theory should be invariant under 
SL(2,C) global conformal transformations so, in particular, L_i|0) = 0. As 
a consequence, there is no state with Lq eigenvalue 1 in the vacuum sector of 
the theory. Furthermore, there is only one state with Lq eigenvalue 2 since 
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£?_i|0) = 0. Indeed, at every level, there are fewer states than one would 
naively expect. 

The condition L-i|0) = follows from the SL(2,C) invariance of the 
vacuum. Another way of deriving it, which can be generalized to other 
primary fields is by condiering the inner product 

|L_i|0)| 2 = <0|LiL_i|0) (15.79) 

Using the commutation relation [L±, L-i] = 2Lq, and the condition Li|0) = 
0, we see that 

|L_i|0)| 2 = (0|2L |0) = 2h = (15.80) 

since the vacuum state has h = 0. A state with vanishing norm is called a 
null state. In order to have a unitary representation of the Virasoro algebra, 
we remove such states from the Hilbert space which we are constructing. In 
a similar way, we will use the constraints imposed by the Virasoro algebra 
for a given c on a representation with a given h to find other null states. 

Before we do so, let's pause for a moment to see why this is so important. 
Knowing the spectrum of a quantum field theory is not a full solution of the 
theory. In order to compute correlation functions, we also need the matrix 
elements of the operators of interest. In a conformal field theory, we can 
focus on primary operators because the correlation functions of descendent 
operators can be obtained from them. In order to compute correlation func- 
tions of primary operators, we need to know how to decompose the tensor 
products of irreducible representations of the Virasoro algebra into a sum of 
irreducible representations. It is useful to consider the analogy with SU(2). 
Suppose that we have the correlation function 

(0|C 1 C 2 ---ejO) (15.81) 

where the (p l a , a =|, j are spin-1/2 fields. This correlation function will only 
be non-vanishing if these fields are taken in some spin-singlet combination. 
For example, for n = 2, the result is proportional to e aiCt2 ; for n = 3, there is 
no way of making an invariant combination, so the correlation function must 
vanish; for n = 4, there will be a contribution proportional to e aiOC2 e a3a4 and 
a contribution proportional to (& v &) aia2 • ( aV &) a3a4 'i an d so on. In short, 
we can determine the spin structure of this correlation function if we know 
how to construct SU (2) invariants out of tensor products of spin-1/2 fields. 
More generally, we can consider correlation functions of fields of arbitrary 
spins; again, we simply need to know how to construct invariants. In the 
case of SU(2), there is a simple decomposition 

h®h®l%- j2 \h (15.82) 
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By applying this relation n — 1 times to a correlation function of n fields 
and, finally, keeping only the spin j = piece after the last step, we obtain 
the spin structure of the correlation function. 

In order to calculate correlation functions in a conformal field theory, we 
need to know how to multiply representations together in this way in order 
to get invariants. In the case of two-point functions (^i^), we know that 
we need hi = hi- In the case of three-point functions, {$i$2®3), we need 
to know whether the OPE of <I>i and <E>2 contains $3. This is is essentially 
the question of whether the product of the representations (c, hi) and (c, h-i) 
contains (c, /13). If we know this, then we can compute the correlation func- 
tion because the spatial dependence is essentially determined by conformal 
invariance. Consider the four-point function (^i^*^^)- Using the OPE 
of $1 with <3?2 and the OPE of <I>3 with $4, we can write this as 

<$l(zi) $ 2 (Z 2 ) $3(^3) $4(^4) ) = ( J> 2fc (*1 " Z2) hk - hl - h2 $k{z2) X 

k 

n 

= E 3^ro^ <•.<*>•*<«.» 

k,n Z 12 z 34 

(15.83) 

In principle, the sums on the right-hand-side run over all fields in the theory, 
and are, hence, unmanageable. However, we can group each primary field 
with its descendents, and thus reduce the sum, formally, to a sum over pri- 
mary fields. This is still unmanageable if there are infinitely many primary 
fields. Theories with a finite number of primary fields are called rational 
conformal field theories. In these theories, the sum on the right-hand-side 
can be reduced to a finite sum and some progress can be made. 

Even when there is a finite number of primary fields, how can we deter- 
mine the QjfcS? If we knew more about the theory, for instance if we knew its 
action and had some sense of how the different primary fields were related 
physically, then we might be able to deduce which primary fields appear 
in the operator product of two others. If we had additional symmetries in 
the theory, then we might be able to derive further restrictions. However, 
suppose that we wish to proceed purely algebraically, knowing only c and h 
and no further information about the theory. In such a case, we must take 
advantage of the existence of null states. 

If we take the tensor product of two representations of the Virasoro 
algebra (c,hi) and (c, fo) and one or both of them have null states, then 
this limits the representations (c,hk) which can appear in their product. 
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(c, hk) must also contain null states, and at certain specific levels which are 
predicated by those of (c, h\) and (c, /12). Thus, the existence of null states 
is a boon, not a nuisance. 

15.9 Null States 

In order to solve a conformal field theory with a given central charge, c, 
we need to determine the primary fields of the theory, i.e. the spectrum 
of /is, and also how these different irreducible representations are tensored 
together. In general, there is no way to solve either problem purely alge- 
braically. However, for certain values of c, the Virasoro algebra will not allow 
unitary representations for most values of h; it will require the existence of 
negative norm states. Hence, at these values of c, we can figure out the 
allowed primary fields in the theory since they correspond to those values 
of h ate which a unitary representation is possible. Note that, in principle, 
two different conformal field theories with the same c could include different 
subsets of the allowed hs. In order to solve the second problem - how to 
decompose the tensor product of representations - we can, as we discussed 
in the previous section, make progress in those fortunate situations in which 
the allowed representations have null states. By reducing the size of the 
Verma module, these constrain the OPE. It may turn out that the OPE 
requires that all of the allowed hs must actually be in the theory in order 
for the OPE algebra to close; alternatively, there may be a consistent OPE 
involving some subset. 

From a logical standpoint, the uses of negative norm states and null 
states are different: the former determines the allowed hs; the latter, the de- 
composition of the products of these hs. However, they are usually discussed 
together because the existence of both can be derived with one fell swoop. 
Here, we will eschew this approach, and discuss them somewhat separately 
and in the reverse of the normal order. 

Let us see how null states can be used in a particular example, which 
will turn out to be the Ising model. We consider a theory with c = 1/2. If 
h = 0, which corresponds to the identity operator, then there is a null state 
at level one since L_i|0) = 0, as we saw in the previous section. Now, let's 
consider some representation in this theory with h 7^ 0. Consider the two 
states at level 2, L^^h) = and L^\h) = 0. Suppose that they are not 
linearly independent, so that there's a null state at this level. Then, 



L- 2 \h) + aL\\h) =0 



(15.84) 
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for some a. Acting on this with L\, and using the commutation relations of 
the Virasoro algebra together with the fact that Li\h) = 0, we find 

= Li (L_ 2 |/i) + aL 2 _ 1 \h)) 
= 0+([L 1 ,L_ 2 ]+a[L 1 ,L 2 _ 1 ])\h) 
= (3L_i + aL_i (2L ) + a (2L ) L_i) 
= (3 + 2o(2/i + 1)) 

Hence, the state (15.84) is a null state only if 

3 

a ~" 2(2/i + l) 
If this state is null, then it must also be true that 

= L 2 (L_ 2 |/i> +aL 2 _ 1 \h)) 
= 0+([L 2 ,L_ 2 ]+a[L 2 ,L 2 _ l ])\h) 

= (4L + °- + aL_! (3Li) + a (3Li) L_iJ 
= (4/1 + | + 6a/i) (15.87) 

Substituting the value of a obtained above, this implies that 

16/i 2 - (10 - 2c)h + c = (15.88) 

For c = 1/2, this has the solutions h = 1/16, 1/2. 

As we will see, these correspond to the spin field, a and the energy 
operator, e in the Ising model. According to this identification, we can 
obtain the critical exponents 77 and v from the two-point functions of the 
dimension 1/16 and 1/2 operators. In the Ising model, the physical fields are 
left-right symmetric cominations, so these dimensions are effectively doubled 
by the anti-holomorphic dependence. 

M^MO^)--^-! (15.89) 

from which we see that 77 = 1/4. Similarly, 

from which we see that v = 1. These are, indeed, the well-known critical 
exponents of the 2D Ising model. 



(15.85) 



(15.86) 
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Suppose that we want to compute a more non-trivial correlation function 
in the critical Ising model. We can use the existence of null states is the 
h = 1/16 and h = 1/2 representations. Consider, for example, the four-point 
function of the spin field a. Since 



2(2-^ + 1) 



L 2 ! |1/16> = 



(15.91) 



it is equivalently true that 



L 2 1 )a = 



(15.92) 



when cr{z) is inside of some correlation function. As we will see in a moment, 
this will give us a differential equation satisfied by the four-point function. 

In order to derive this differential equation, we begin by noting that the 
following correlation function, 



(Wi) . . . (p n -i (w n -l) L- k (j) n (z)) 



(15.93) 



would vanish if we move the to the left, where it could act on the 
vacuum state and annihilate it. However, in the process of moving it to the 
left, it must commute with the fa (u>j)s. Using equation (15.78), this means 
that 



(j>i (u/i) . . . <f> n -i K-i) j> ^ (z , T(z')Mz)^ 



(15.94) 



can be simplified by taking breaking the contour into small circles encircling 
each of the iWjS. Thus, we obtain 



(<Pl {Wl) ■ ■ ■ <t>n-l (Wn-l) L_ k (j) n (z)) 

"~ J r (l - k)hj 1 d 



-E 



+ 



(wj — z) k 1 dw j 



(wi) ... n _l (w n -i) n ((l2^.)95) 



Applying this, in conjunction with (15.92) to the a four-point function, 
we have 



n-l 



a dzf ^ 

1 3=1 



1/16 



+ 



1 



d 



Z4 — Zj) 2 Z4 — Zj dZj j 



(a (zi) a (z 2 ) (J (z 3 ) a (z 4 )) 

(*5ffl6) 
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In the first term on the left, we have used L_i0 = d<p which is simpler than 
(15.95), but equivalent to it by translational invariance. 
Using global conformal invariance, we can write 



1/8 

{a (zi) a (z 2 ) a (z 3 ) a (z 4 )) 



( Z13Z24 ) F(x) (15.97) 



where x = z\ 2 z 34 j z\ 3 z 24 . We have suppressed the dependence on z for 
simplicity. 

Substituting this form into the differential equation (15.96), we have the 
ordinary differential equation 

dx 2 Jo 
This equation has two independent solutions 



a < 1 - a! »S? + U- a! Jte + 5)- (15 ' 98) 



/l, 2 (x) = (l±vT^) 1/2 (15.99) 

This is clearly a multiple-valued function. In order to get a single- valued 
result, we must combine the holomorphic and anti-holomorphic parts of the 
theory. The only way of doing this is by taking the left-right symmetric 
combination f\(x)f\(x) + f 2 (x)f 2 (x): 



(a (zi,zi) a (z 2 , z 2 ) a (z 3 , z 3 ) a (z 4 , z 4 )) = 

1/4 ^ 

(|i + VT^\ + |i - v / T^(i^ioo) 



^13^24 



^12 ^23 234 ^41 

for some a. 

Now, we can determine a±, a 2 as well as determine the OPE of a with it- 
self by considering the behavior of this correlation function in various limits. 
First, let's conside the OPE of a with itself: 

a(zi,zi) a(z 2 ,z 2 ) = — 3_ + C aa£ \z 12 \ 3/4 " e (z 2 ,z 2 ) + ...(15.101) 
R2I 

At this stage, we do not yet know whether there are other primary fields 
in the c = 1/2 theory, so the . . . could, in principle include both primary 
and descendent fields. In fact, as we will see later, there are none, so the 
. . . contains only descendent fields. If we take z\ 2 — ► and Z34 — > in the 
four-point function and use these OPEs, we have 



(a (z!,zi) a (z 2 , z 2 ) a (z 3 , z 3 ) a (z 4 , z 4 )) = 
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I 1 1/4 I 1 1/4 
F12 F34 



+ C<T CT e 1^12 



3/8 M 3/8 <e (z 2 ,z 2 ) e(z 4 ,z 4 )) + 



I I 1/4 I 
F12 F34 



i |„ 1 3/4 I 1 3/4 

^TTI + ^ M -4(15.102) 



P24 



Meanwhile, taking the same limit in (15.100), we have 



(a(zi,zi) a(z 2l z 2 ) a(z 3 ,z 3 ) (7(2:4,2:4)) 

1/4 



^12^34 



1 Z12Z34 



6 24 



+ 



1 ^12^34 

2 7 2 ^ 



3.103) 



Hence, comparing the leading terms, we see that a=\. Comparing the next 
terms, we also see that C aa£ = \. We also note that there are no operators 
with (h,h) < appearing in this OPE (which would be natural if there 

were no other primary fields in the theory). 



15.10 Unitary Representations 

In fact, the h = 0, 1/16, 1/2 representations are the only ones in the c = 1/2 
theory. If we tried to construct a representation with any other value of h, 
it would be non-unitary, so it should not arise in most physical theories. (As 
we will discuss in the next part of the book, systems with quenched random 
disorder are described by non-unitary field theories, so the requirement of 
unitarity does not help us there.) Let us see why this is so. 

At level 2, the existence of a null state can be determined by taking the 
determinant of the matrix 

/ (h\L 2 L- 2 \h) (h\L\L_ 2 \h) \ _ ( Ah + c/2 6/1 \ 
V {h\L 2 LWh) {h\LlL\\h) ) ~ V 6h 4/i(l + 2/1) J 

= (I6h 3 - Wh 2 + 2h 2 c + he) 
= 32 (h - /11.1(c)) x 

{h- h li2 \c)) (/i- /i 2 ,i(c)Xl5.104) 

where 

_ [(m + l)p - mq} 2 - 1 
p,q Am(m + 1) 

m= -Win^ (15 - 105> 

h\.i = corresponds to the null state at level 1, which must propagate to 
higher levels. At c = 1/2, h± >2 = 1/16 and /i 2 ,i = 1/2 are the null states 
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at level 2. If, in the c = 1/2 theory, we had considered a representation at 
h = 1/4, for instance, then we would have found this determinant would 
be negative. This would imply that there are negatie norm states in the 
representations, so that it could not be unitary. In order to find the full 
set of restrictions on the allowed /is, we must also consider the analogous 
determinant at levels 3,4, . . .. 

The determinant of inner products at the -/V th level is given by the fol- 
lowing formula, which generalizes (15.104) 

detM N (c,h) =a N ]j (h - h M {c)) P{N ~ pq) (15.106) 

pq<N 

where a;v is a constant independent of c,h. We will not prove this for- 
mula here, but the basic idea is to write down for each p, q an explicit null 
state at level pq. Each of these null states leads to P(N — pq) null states 
L_ ni . . . L- nk \h + pq) at level N (where P(n) is the number of partitions 
of n). A polynomial is determined, up to an overall constant, by its ze- 
roes, which leads to the result above. One can check that the null states 
constructed are the full set by comparing the highest power of h on both 
sides. 

Let us now consider the question of whether the eigenvalues of detMjy (c, h) 
are positive. Keep in mind that the factorization (15.106) gives the product 
of eigenvalues, but the factors in (15.106) are not the eigenvalues themselves. 
When the determinant is negative, there are an odd number of eigenvalues. 
For 1 < c < 25, h > 0, m is not real so the h Ptq s either have an imaginary 
part or are negative (the latter only occurs for p = q). Hence, the determi- 
nant never vanishes. For large h, the eigenvalues are strictly positive since 
they are just the diagonal elements (which are positive) in this limit. Since 
the determinant never changes sign, the eigenvalues must remain positive. 

For c = 1, m = oo, so h VA = (p — q) 2 /4. The determinant vanishes at 
these values, but is nowhere negative. There are null states, which must be 
set to zero, at these hs but there are no negative norm states, so there are 
unitary representations at all hs. 

For c < 1, h > not lying on the curves h P:q (c), there exists some level 
N = pq such that the point (c, h) can be connected to the region c > 1 by 
crossing precisly one of the curves h P: q{c). Since an eigenvalue changes sign 
at these curves and the region c > 1 has only positive eigenvalues, this means 
that there is a single negative norm state at this level for this (c,h). The 
only exceptions are the the curves h Ptq (c) themselves. It can be shown that 
there is a single negative norm state along these curves except at the "first 
intersection" points where two of these curves intersect. At these values of 
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c, h, there are null states but no negative norm states. These crossings occur 
at a discrete series of c: 

c=l . 6 - (15.107) 

m(m+l) 

At each such c, there is a finite set of allowed /is: 

h pq (m)= [(m+ A 1) r m f ~ l (15-108) 
p ' gy ' 4m(m + l) 

These theories are called 'minimal models'. The case m = 3 is the Ising 
model, as we mentioned earlier, m = 4 is the tricritical Ising model (in 4 — e 
dimensions, this is a scalar field with potential V(<p) = + u<f) A + t></> 6 ; the 
tricritical point is at r = u = 0), while m = 5 is the 3-state Potts model. 

As we claimed earlier, there are only three possible primary fields in the 
c = 1/2 theory, with h = 0, 1/2, 1/16. Any other h would be a non-unitary 
representation of the Virasoro algebra for c = 1/2. 

Using techniques analogous to those which we used in our discussion 
of the c = 1/2 theory, we can compute the OPEs of fields in the minimal 
models. Focussing on the chiral part of these theories, we can examine 
the three-point correlation functions to determine which primary fields can 
appear in the OPE of two primary fields. Such a relation is called a fusion 
rule: 

0Pi,<zi x <^W = X^P3,93 (15.109) 

The rule which specifies which q% can appear on the right-hand-side of this 
equation can be expressed most neatly by writing pi = 2ji + 1, = 2j' i + 1. 
Then the allowed j$s are \j\ — ji\ < J3 < min {j\ + j'2, m — 2 — j\ — j'2), and 
the analogous rule holds for j' 3 . This is almost the same as the decomposition 
of the product of the spin j\ and ji representations in SU(2); the only differ- 
ence is that the upper limit 31+J2 is replaced by min (ji + j'2, m — 2 — ji — j'2). 
The origin of this rule will be clearer when we discuss Kac-Moody algebras. 

15.11 Free Fermions 

In 1 + 1 dimensions, the action of a Dirac fermion = (^) can be written 
in the form: 

s = j Y^tV^ 

= J tp^dtpn + ip^dipL (15.110) 
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where 7 = a x , 7 1 = a y . Thus, the fermion splits into two independent 
fields, one right-moving and the other left-moving. The right-moving field 
satisfies the equation of motion dipR = while the left-moving field satisfies 

d^ L = 0. 

A right-moving Dirac fermion can be written as the sum of two right- 
moving Major ana fermions which are its real and imaginary parts ip = tp\ + 
iip2- In radial quantization, a right-moving real fermion is one whose mode 
expansion 

ii;(z) = J24>nZ- n - 1/2 (15.111) 

satisfies tpt = ip- n . This is what one usually means by a real field, but it 
is somewhat masked by the fact that we're in imaginary time, z = e T ~ tx , 
and the extra 1/2 in the exponent which results from the passage from the 
cylinder to the plane in radial quantization. The inverse Fourier transform 
is: 

V>n = f^-.i^(z)z n - 1 / 2 (15.112) 
The action for a right-moving Majorana fermion ip(z) is 

S = J (15.113) 

(The fact that we can simultaneously diagonalize chirality and charge con- 
jugation is special to 4k + 2 dimensions. In general, we can either have a 
right-moving fermion (a Weyl fermion) or a Majorana fermion.) From the 
action (or, essentially, by scaling), we can compute the OPE of a Majorana 
fermion: 

Mz)4>(w) = — + ... (15.114) 

z — w 

The energy-momentum tensor is 

T = _I : ^dil> : (15.115) 
The central charge is obtained from 

/ x / x 1/ x ( 1 A 2 1 1 - 1 

T(z)T(w) = - -1 ^ + - ^ ^ + 

4 \ z — w J [z — wy 4 [z — w) z [z — wy 

■T(w) 



(z — w) 2 
1/4 . 
(z — w) A (z — w 



1/4 (15.116) 
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The (—1) in the first term on the first line results from an anticommutation. 
From this calculation, we see that c = 1/2. Hence, a theory of a free 
Majorana fermion is the same as the critical theory of the Ising model. This 
should not come as an enormous surprise if one recalls our rewriting of the 
partition function of the 2D Ising model as a Grassman integral. Thus, a 
Dirac fermion, composed of two Majorana fermions, has c = 1. 

From the OPE (15.114), we can compute the anticommutator of the 
modes of ip: 



2iri J 2iri z — w 

= $n+m,0 (15.117) 

Fermionic fields can be either periodic or antiperiodic as one goes around 
the cylinder, z — > e 2m z. Physically measurable quantities are always bosonic, 
so they must be composed of fermion bilinears, which are single-valued. For 
periodic boundary conditions, we need n G Z + i in the mode-expansion 
(15.111), while n £ Z gives antiperiodic boundary conditions. Note that for 
anti-periodic boundary conditions, n £ Z, there is a zero mode, ipQ. Accord- 
ing to the canonical anticommutation relations, {V'Oj^o} = 1> or t/>q = 1/2. 
A little bit later, we will discuss the zero mode operator a little further. 

Let us compute the fermion propagator in these two cases. For periodic 
boundary conditions, 

«.< z ) aw)) = f <V </>„ z-"- 1/2 Y. *•» »" m_1/2 



E 

n=l/2 
oo 



z -n-l/2 w n-l/2 



n=0 



Z — W 

For antiperiodic boundary conditions, 



(15.118) 



oo ^ 



n=l 
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Jzw 

',2 _ 1 



(l^ + «>) < 15 - 119 > 



using ?/)g = 2, we have: 



{Hz) Hw)) = 2W^Vz) (15 12Q) 

As expected, the short-distance behavior of the propagator is the same for 
both boundary conditions, but the global analytic structure is different. 

We can introduce a 'twist' operator, a, which, when placed at z\ and Z2 
introduces a branch cut for the fermions which extends from z\ to Z2- The 
OPE of a Majorana fermion tp(z) with its twist field a{z) is: 

#*M0) = -L M (0) (15.121) 

where fi(z) is an 'excited' twist field of the same dimension as a (we will 
discuss this more later). The on the right-hand-side ensures that a 
minus sign results when ip(z) is taken around <r(0). If we place twist fields 
at the origin and at oo, this will exchange periodic and antiperiodic boundary 
conditions. Thus, 

(0; P |ct(oo) il>{z) ip{w) cr(0)| 0; P) = (0; AP \^{z) ^{w)\ 0; AP) (15.122) 

Differentiating both sides with respect to w, we have 

(0; P |cr(oo) %l>(z) d w ip{w) <r(0)| 0; P) = d w (0; AP \if>(z) ip{w)\ 0; AP) (15.123) 

We know the right-hand-side. The left-hand-side can be re-expressed in 
terms of the energy- momentum tensor, T by taking z — > w and subtracting 
an infinite constant — l/(z — w) 2 . 

1 1 



(0;AP|V(z)cWH|0;AP) = --V-J2 + 



(z — w) 2 8 w 2 

Thus, 



(z - w) 2 4 w^z 1 / 2 

+ \\ + 0(z - w\l$.l24) 



^))AP = -«^2 ( 15 - 125 ) 



1 1 

16 7 2 

Hence, h a = 1/16. Thus, our use of the notation a is more than merely 
suggestive. The twist field, a, is the spin field of the Ising model. 
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To summarize, antiperiodic boundary conditions are simply the sector 
of the full c = 1/2 theory given by h = 1/16 Verma modules. In this 
sector, there are actually two states with Lq eigenvalue 1/16 because there 
is a zero mode i/jq. This zero mode can be either occupied or unoccupied. 
For a Majorana fermion, tp is a linear combination of a creation and an 
annihilation operator (with coefficient 1/^/2), so acting twice with Vo wu l 
create and then annihilate (or the reverse if the zero mode were initially 
occupied) a fermion in this mode, thereby leaving the state unchanged, up 
to a factor of 1/2. One of the two states is <r(0) |0) (zero mode unoccupied) 
while the other is fi(0) |0) (zero mode occupied), a is the spin field, or order 
operator, while ji is the disorder operator to which it is dual. 

Thus, the c = 1/2 theory of a free Majorana fermion has one copy of the 
h = representation, one copy of the h = 1/2 representation, and two copies 
of the h = 1/16 representation. These representations are obtained by acting 
with the L_„s, on, respectively, |0), ^_i/ 2 |0), cr(0)|0), ^ cr(0)|0) = //(0) |0). 
h = Representation h = 1/2 Representation 

Lo eigenvalues States L eigenvalues States 

|0> 1/2 ~ V-1/2IO) 

1 3/2 V-3/2 10> 

2 ^-3/2^-1/210) 5/2 V>- 5 / 2 |0> 

3 ^-5/2^-1/210) 7/2 ^- 7 / 2 |0) 



h = 1/16 Representation 
States 

luT) 

^-2lw> 

^-3lll) 



Lo eigenvalues 
j_ 

16 1 
! + H 

2 + B 

3 + B 



/i = 1/16 Representation 
States 

V'oIb) 
V'-iV'ol 

-0-2-001 1^) 
^-3^0^) 



Lq eigenvalues 
j_ 

16 1 
! + T6 

2 + 4 

3 + 



1 

16 



The Ising model itself has both right- and left-moving Majorana fermions. 
One might naively think that its Hilbert space could have representations of 
all possible right-left combinations, (0,0), (1/16,0), (1/16,1/2), etc. How- 
ever, this is not the case. In fact, there are only the right-left symmetric 
combinations: (0,0), (1/16,1/16), (1/2,1/2). This can be derived by di- 
rect computation on the Ising model, of course. It can also be obtained by 
putting the theory on a torus. The constraints associated with consistently 
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putting a theory on a torus ('modular invariance') restrict the allowed right- 
left combinations of the set of possible (h,h)s at a given (c, c). In the case 
of the Ising model, the only allowed combinations are the symmetric ones. 



15.12 Free Bosons 

Earlier, we considered a theory of a free boson, 4> =\ (<p(z) + Tp(z)), which 
has c = c = 1. This theory has (1,0) and (0,1) fields dip and dip. Let us 
now consider the exponential operators : e taip :. By Wick's theorem, 

. e ia<p(z) . . e ia<p(0) = e a 2 ((^M0)}-M0)) 2 ) 

1 

a 



(15.126) 

z L 



Hence, : e 10 "^ : is a dimension a 2 /2 operator. In fact, it is a primary operator, 
as we now show. These exponential operators have the following OPE with 

idip 

idip(z) : e^ (0) ■■ = - z - e iam : + . . . (15.127) 
Hence, the OPE with the energy-momentum tensor, T(z), is: 

-l :M ^ ): :e^: = -I(^) 2 + ... 

- a /2 + ... (15.128) 



(z — w) 2 

Thus, as claimed, the operator : e iaip : is a dimension a 2 /2 primary field. 

Now, suppose that <\> is an angular variable 4> = 4> + 2ttR. In such a 
case, dip and dip are still fine, but not all exponential operators are single- 
valued under (j) — > (ft + lixR. Consider operators of the form : e im ^^ R :=: 
e im(ip+ip)/2R They are clearly well-defined, and have dimensions (h,h) = 

(l (jk) 2 1 \ (iD?) 2 ) • Thus, these operators are among the primary fields of 
the theory. As we will see shortly, they are not the only primary fields in 
such a theory. 

In order to derive the full set of primary fields, it is useful to consider 
the mode expansion for a system of free bosons: 

idip(z) = J^anz-* 1 - 1 (15.129) 
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with 



a n = I — z n idip(z) (15.130) 
J 2m 



The commutators of the modes are: 

o j dw r 
n,„.o„ - /" t> 



2m J 2m (z — w) 2 
dw m _ _ i 

2iri 

= nS n+mfi (15.131) 

As a result of the n on the right-hand-side, the zero mode, ao, commutes 
with itself, as it had better. Consider the mode expansions of (p, Tp, and 4> 
itself: 



ip(z) = ifo + a In z + - a n z n 
^-^ n 

Tp(z) = ifQ + ao In z + ^2 - a n z~ n 



<f>(z,z) = 4> + a lnz + a lnz + (a n z- n + a n z n ) (15.132) 

where <fio = (ip^ + <Pq)/2. The zero modes ip^ L disappear from dip and 
&p, but not from the exponential operators : e 1171 ^^ :. The commutation 

Ft L • — 

relations of ip ' can be derived from the OPE of ip, Tp with dip,dTp: 



/dz 
— <p(z)d<p(0) 



dz i 
2tH2z 

= i (15.133) 

By similar steps, [i^Q,ao] = i- Hence, [<po, (ao + ao)] = i while [4>o, (ao — 
®o)] = 0. Alternatively, we can go back to the cylinder with coordinates x, r 
with z = e T ~ %x . Then, the mode expansion is: 



<f>(x, t) = 0o+ \ (a + a ) t+\ (a - a ) x+J^ - (a n e nr ~ inx + a n e nT+inx ) 

(15.134) 
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Meanwhile, according to the canonical commutation relations for (f>, 

[<p(x, T),do<f>(0, t)] = i S(x) (15.135) 

from which we obtain 

[4> ,{ao + a )]=i (15.136) 

With this commutation relation in hand, we can derive the spectrum of 
ao + ao, as we show below. 

Now, let us turn to the energy-momentum tensor and its eigenvalues. 
Since T(z) = — \ : dtpdtp :, 



L = ^a_ n a n + ]-al 



n>0 

L mf L = -^a m _„a n (15.137) 

n 

For m / 0, a m _ ra and a n commute, so we do not need to worry about 
normal ordering in the second line. There is a similar expression for the 
anti-holomorphic modes. From the commutation relations (15.131), we see 
that ao commutes with the Hamiltonian. Hence, we can label states by their 
ao, ao eigenvalues. Let's define a set of states |a,a) by 

ao|a,a) = a|a,a) 
a~o|a,a) = a|a,a) 
a n |a,a) = for n > 

a n |a,a) = for n > (15.138) 

We can build towers of states on these by acting with the a ra s for n < 
or, equivalently, with the L n s for n < and also with their anti-holomorphic 
counterparts. These are, in fact, Verma modules, and the states |a,a) are 
highest weight states. Their Lq, L eigenvalues are: 

Lo\a,a) = — ao|a,a) = — a 2 |a,a) 

Lo|a,a) = -a 2 |a,a) (15.139) 

These highest weight states are created by exponential operators. The 
simplest is the vacuum state itself, defined by 



«o|0) =0 
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a |0) = (15.140) 

Now, consider the state : e^ ) : |0), 

: e iam : |0) = e ia ^|0) (15.141) 

the right-hand-side follows from the fact that a n |0) = for n > 0, while the 
n < terms in the expansion vanish when we take z = 0. Thus, acting with 
ao, ao, 

a :e w(0) : |0) = a e Wo |0) 

e Wo a + e*^j |0) 



Similarly, 



1^ : e W(0). | ) (15.142) 



a : e w{0) : |0) = : e*(°» : |0> (15.143) 



To summarize, the different Verma modules of the theory correspond 
to different values of the zero modes, |a,a). Some of these highest weight 
states are created by the primary operators : e l/3< ^°) : acting on the vacuum: 

: e w(0) : |0) = \(3/2, (3/2) (15.144) 

Let us find the other primary fields in the case in which <j) is an angular 
variable, 4> = 4> + 2-kR. Then, 4>o = 4>q + 2nR. Since [4>o, (ao + «o)] = h the 
eigenvalues of (ao + ao) are quantized, (ao + ao) = ^. 

Periodic boundary conditions around the cylinder require, according to 
(15.134), that ao — ao = 2nR for some integer n, so that (ft — ► <j) + 2imR 
when x — > x + 2ir. Putting these two together, we have 

_ m 

a + a = — 

a -a = 2nR (15.145) 

or 

771 771 

a ° = 2~R + njR ' ® = 2R~ UR (15.146) 

Thus, the highest weight states of a free boson with angular identification 
4> = 4> + 2-irR are: 

™ + nj R, ™ - nR ) = : e <(ft+«H)v e *(ft-»«)? : | ) 
2ri 2-fl / 
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= . e im<t>/R e 2inB4 . ^ (15.147) 

where we have defined the dual field <f> = ^(ip — Tp). These states have 
{Kh) = (\(^+nR)\\{^-nRf 



The action of the exponential of the dual field, : e 2inRl ^ : is to increase 
(«0 — «o)/2 by nR. This causes cf) to wind n more times around 2ir as 
x winds around the cylinder or z winds around the origin in the complex 
plane. In other words, : e 2mR< K°) : creates an n-fold vortex at the origin of 
the complex plane. 

Let us consider the special case R = 1. At this radius, the primary states 
are | y + n, y — n) Let us divide these into the states in which m G 2Z + 1 
and those in which m G 2Z. The states with m G 2Z are of the form 

L \n R , n L ; P) = ^ n 2 R \n R , n L ; P) (15.148) 

with the obvious counterpart for Lq. Here, n R ^ = y ± n G Z . The states 
with m G 2Z + 1 are of the form 

L \n R , n L ; AP) = ^ (n R + \n R , n L ; AP) (15.149) 

with the obvious counterpart for Lq. 

These are the primary states of a Dirac fermion, which also has c = 1 
since it is composed of two Majorana fermions, ip R = ip R ,i + iip R ,2, each with 
c = 1/2. The same holds for the anti-holomorphic part of the theory, with 
c = 1. If we make the identifications 

e llp = 1pL = lpL,l + iipL,2 
e 2i * = Ml 

e t4> = OR,l<TR,2 °L,1°L,2 (15.150) 

a R,2, j cl,2 are the twist fields for ip R) L;i,2 or, simply the product of 
the twist fields for the Dirac fermions ip R , ipi- This product has dimension 

From these formulae, we see that 

= e -M^) e Mw) 



J R^ 

1 



w 
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1 



(1 — i(w — z)dip(w) + . . .) 



z — w 



1 



+ idip 



(15.151) 



z — 



w 



Hence, upon normal ordering, we get the conserved quantity 



n R =: ip^ipR := id(p 



(15.152) 



similarly, = idlp. 

One slightly peculiar thing about this theory is that we started with two 
copies of the c = 1/2 theory, which has only 3 primary states, corresponding 
to the (1) even and (2) odd fermion number in the untwisted sector and the 
(3) twisted sector (where even and odd fermion number are not distinguished 
because the zero mode moves one between them). When we tensor these 
two copies of the theory together, we don't get 9 primary states but, rather, 
infinitely many primary states. This is not really so peculiar when we recall 
that we are now talking about primary under the c = 1 Virasoro algebra of 
the combined theory, not under the individual c = 1/2 Virasoro algebras. 
The primary states correspond to different tir and til in the twisted and 
untwisted sectors. There is a big difference between a Dirac fermion and 
a Majorana fermion because the former has a conserved charge (or, rather, 
two conserved charges, one right-moving and one left-moving), while the 
latter does not. 

15.13 Kac-Moody Algebras 

We now consider conformal field theories which have other symmetries - such 
as charge conservation - in addition to symmetry under conformal transfor- 
mations. As we will see, when these other symmetries mesh with conformal 
invariance in a certain way, they lead to the existence of null states, which 
render the theory soluble. In such a case, the symmetry generators form a 
Kac-Moody algebra. 

Consider a theory with conserved currents J®, d^J^ = 0, where a = 
1,2, ... ,n label the currents. The associated conserved charges are Q a = 
f cLxJq. They satisfy some symmetry algebra [Q a ,Q b ] = if a bcQ c , where 
the fabcS are the structure constants of some Lie algebra G. (The commu- 
tator must equal a linear combination of charges because any state which 
is invariant under the symmetry transformations generated by the charges 
must be annihilated by the charges and also by their commutators.) In the 
case G = SU(2), f abc = e abc . 
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Now, suppose that the dual current & v J" is also conserved, d^e^J® = 0. 
Then, 

dJ = dJ = (15.153) 

where J = Jo + iJi, J = Jo — iJ\. In other words, J(z) is holomorphic while 
J (z) is antiholomorphic. 

There two conserved charges, associated with the integrals of the current 
or the dual current. In more physical terms, the right-moving and left- 
moving charges are separaely conserved. The charges Q R = § dzJ and 
Q R = <f dzJ must be dimensionless since, as conserved quantities, they 
commute with Lq and Lq. This, in turn, implies that J and J are dimension 
(1,0) and (0,1) fields, respectively. 

The most general form for the OPE for J a is 

U xab ■ fabc 

r(z) J b ( ) = 4p + +^— J c (0) + . . . (15.154) 

The second term on the right-hand-side is dictated by the commutators of 
the Q a s. The first term is allowed by the global symmetry (generated by the 
Q a s) and by scaling. The normalization of the central extension k is fixed 
by the normalization of the structure constants in the second term, except 
in the Abelian case, in which they vanish. 

This OPE can be translated into the commutation relations of the modes 

'JL4] = if abc J c m+ n + km5 ab S m+nfi (15.155) 

The m = modes still satisfy the commutation relations of the Lie algebra 
G. 

The simplest example of a theory with a Kac-Moody algebra is the free 
boson. The conserved currents are: 

J(z) = idip 

J{z) = idTp (15.156) 

These are the currents associated with the global symmetry <p — ► 4>+c, where 
c is a constant. As a result of the decoupling of the right- and left-handed 
parts of the theory, there is actually a much large set of symmetries of the 
classical equation of motion, ip — > ip + f(z), Tp -^Tp + f(z). 
The OPE of the currents is 



J(z) J(0) = - dip(z) d<p(0) 
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= ^ + 2T{z) + ... (15.157) 

There is no 1/z term, as we would expect since the U(l) algebra is Abelian. 
We see that k = 1, irrespective of whether the boson is an angular variable 
(i.e. is 'compactified'). (In this case, this is somewhat a matter of convention 
because there is no term on the right-hand-side linear in the currents so we 
can rescale J(z) to change k to whatever we wish.) The special case of a 
boson with R = 1 is equivalent to a Dirac fermion, which also has a U{\) 
symmetry, tp — > e td tp, which is promoted to a Kac-Moody symmetry with 
J = ip^ R ipR and J = V/jV'L- 

A more interesting example is afforded by a free boson at R = l/s/2. 
Now, there are additional currents, e^^v 7 ^ _ e ±i(<f>/R+2$R) ^ w j 1 j c ] 1 have di- 
mension (1,0), and e ±ltp ^ = e ±l {4>/R-^4>R) w hich have dimension (0,1). If 
we write J z = idip, = e^ 1 ^^ and similarly for the antiholomorphic cur- 
rents, then these operators have the OPE expected for an SU (2) Kac-Moody 
algebra: 

J+(z) j-(0) = 1 + ^ J z (0) + ... 

J ± (z)J z (0) = ±^J z (0) + ... (15.158) 

Again, the level is k = 1. Such an algebra will be called SU(2)i. More 
generally, an SU(2) Kac-Moody algebra at level k wil be called SU(2)k] 
that associated with an arbitrary Lie algebra G, G^. 

The same algebra arises in a theory of 2 Dirac fermions. We focus on 
the holomorphic part for simplicity: 

/ (15-159) 

where a = 1,2. This action is invariant under the symmetry ip a -,R — > 
Uapipp.ft where WU = 1, i.e. U G SU(2). The associated currents are 

J a = ^a-R^^a;R (15-160) 

where r"g, a = 1,2,3 are Pauli matrices. This can be generalized straigh- 
forwardly to a theory of N Dirac fermions, 



^A-R d ^A-R 



(15.161) 
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where A = 1, 2, . . . , N. This theory is invariant under SU(N) transforma- 
tions, ipAR — ► Uab^b-r with associated currents 

J a =A;R T AB^B;R (15-162) 

and the T^ B s are the generators of the fundamental representation of SU(N). 
They satisfy the SU(N)i Kac- Moody algebra. 

The simplest way of obtaining a Kac-Moody algebra with a level other 
than 1 is to simply take k copies of this theory or, in other words, kN 
Dirac fermions. Such a theory has an SU(kN)i symmetry, but if we focus 
on the SU(N) subgroup of SU(kN), then this algebra realizes an SU(N)k 
Kac-Moody algebra since the level is clearly additive when we take k copies 
of a theory. Note that these fermionic theories all have U(l) Kac-Moody 
algebras as well. 

The bosonic representations of these Kac-Moody algebras are rather in- 
teresting and non-trivial, especially in the k > 1 case. We will return to 
them later. 

Let us now consider the representation theory of Kac-Moody algebras. 
We define a primary field </?( r ) to be a field which transforms in representation 
r of the group G and is primary under the Kac-Moody algebra, 

J Q (z)y (r) (0) = (r) K ' + ... (15.163) 

where the T^s are the matrices representing the generators in representation 
(r). As usual, primary fields create highest weight states, 

|(r),a>=vf r) |0) (15.164) 

Here, we have explicitly written the representation vector index a which 
we suppressed earlier for convenience. The highest weight states form a 
multiplet under the global symmetry generated by the Jq s: 

J$\(r),a}= (T ( « r) ) J(r),/3> (15.165) 

Not all representations (r) are allowed in the theory at level k. For the 
sake of concreteness, we consider the case of SU(2). A restriction on the 
allowed j's in the SU{2)k theory can be found by considering the mode 
expansion of J a , J a (z) = Ylm^rk z ~ m ~ 1 ■ The J^s satisfy the commutation 
relations (15.155). From these commutation relations, we see that I a = Jq 
form an SU(2) Lie algebra. Hence, 2 Jq has integer eigenvalues in any finite- 
dimensional unitary representation. Similarly, I 1 = JJ, I 2 = j£ l5 I 3 = 
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\ k — Jg also form an SU(2) Lie algebra. Consider a spin j highest weight 
state \ j, m = j), with I 3 \j, m = j) = j\j, m = j). Then 

< (j, m = j\I + r \j, m = j) 



= (j,m = j\ 



\j, m = j) 



= (j, m = j\k- 2I 3 \j, m = j) 
= k-2j (15.166) 

Hence, the SU(2)k theory contains the representations j = 1/2, 1, . . . , k/2 
and only these representations. 

Since the I a s form an SU(2) algebra, I 3 must have integer eigenvalues, 
from which we conclude that k must be an integer. 

One remarkable feature of a Kac-Moody algebra is that it automatically 
includes the structure of the Virasoro algebra. Consider T(z) defined by 

T= :J a J a : (15.167) 

k + Ca 

where Ca is the quadratic Casimir in the adjoint representation if the highest 
root is normalized to length 1. This satisfies the Virasoro algebra, 



T(z)T(0) 



1/2 



k + C A 

1/2 
k + C A 

1/2 



k + C A 



: J a (z) J a {z): : J b (0) J b (0): 
x 2 x (J a (z)J a (0)) (>(z)J b (0)) 
x 2 x(^ + ^(0, + ...)x 



+ ... 



k(Y 



ah 



if 



abd 



1/2 



+ 

z" z 
k 2 S ab S ab 



J d (0) + ...+... 



(* + C A f 

1/2 
(* + C A ) 2 

1/2 

(* + C A ) 2 
1 k\G\ 



abc fabd 



f abc f 



-J C J d 



e\o\ 



zed 



Z 4 

k 2 \G\ 



C A 5 cd k5 cd 



2 k + C A z a 
c/2 



with 



k + C A 



(15.168) 
(15.169) 
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where |G| is the dimension of the Lie algebra. One can check that the next 
term in the OPE is as expected for the energy-momentum tensor, so that 

T(z) T(w) = r -^- f + T(w) + — dT(w) + ... (15.170) 

(z — w)^ (z — w) z z — w 

This immediately enables us to compute the dimensions of primary fields. 
According to (15.167), a primary field transforming under G as representa- 
tion r has dimension 

For the case of SU(2)k, this means that a spin j primary field has dimension 
h = j(j + l)/(k + 2). 

The correlation functions of primary fields in a theory with Kac-Moody 
symmetry can be calculated using the Sugawara construction of T(z). Ex- 
panding both sides of (15.167) in modes, the n = — 1 term is: 

L -! = kTc2 ( J - lJS + J - 2Jt + -'^ (15,172) 

We act with this equation on a primary field, using the fact that J® for 
n > annihilates primary fields, together with (15.163). Hence, there is a 
null state given by 

( L -l-]-^ J -l T (r))nr)=V (15-173) 

This null state condition translates into the following differential equation - 
the Knizhnik-Zamolodchikov equation - for correlation functions of primary 
fields: 

(k + C A )^ + Y, Tj T^ } (fir.) (*0 • • • f(r n ) (*0> = (15.174) 

These first-order differential equations can be solved to determine the cor- 
relation functions of primary fields. 

The fact that the T given by the Sugawara construction satisfies the Vira- 
soro algebra does not necessarily imply that it is the full energy-momentum 
tensor of the theory. It may only be the energy-momentum tensor of one 
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sector of the theory; the full energy-momentum tensor is then the sum of sev- 
eral tensors, each of which obeys the Virasoro algebra. The central charges 
add. 

Consider our theory of two chiral Dirac fermions. This theory has both 
U{\) and SU(2)i Kac-Moody symmetries, with 

T Stf(2) = J/! .jaja. (15.175) 

k + 2 y J 

since Ca = 1(1 + 1) = 2; a = 1,2, 3. The central charge associated with the 
SU{2) part of the theory is Cgu(2) = 1- Meanwhile, 

T u ^ = 1 : J J: (15.176) 

The normalization is chosen so that the second term in the OPE of T u ^ 
with itself is 2T/z 2 . As a result, cjjm = 1. The total central charge cgu{2) + 
cum = 2 is the central charge of two Dirac fermions. 

Since the level of the SU(2) Kac-Moody algebra is 1, a spin- 1/2 primary 
field has dimension 1/4. Similarly, a charge-1 primary field under U(l) has 
dimension 1/4. A Dirac fermion carries charge-1 and spin-1/2. Thus, it has 
dimension 1/2, as expected. 

There is not necessarily a unique way of decomposing T into Kac-Moody 
algebras. For instance, consider a theory of kN Dirac fermions, with c = kN. 
This can be decomposed into cjjm = 1 and, from (15.169) with \SU(N)\ = 
N 2 — 1 and Ca = N, csu(kN)! = kN — 1. Alternatively, we can think of our 
kN fermions as k sets of N fermions. There is an SU (N)k symmetry among 
the N fermions (rotating all sets together) and also an SU (k) n symmetry 

among the k sets. c SU (N) k = k+N while c SU (k) N = N +k ■ Thus ' 
c U(i) + c SU{N) k + c su{k) N = kN is an equally good decomposition of the 
theory into Kac-Moody algebras. In the multichannel Kondo problem, with 
N = 2 and k channels, this proves to be a particularly useful decomposition. 

Earlier, we alluded to the bosonic theories which posess Kac-Moody 
symmetry at level k > 1. We now discuss them. These theories are non- 
linear cr-models of Wess-Zumino-Witten (WZW) type. In these theories, 
the basic field, U, takes values in some group G. Since the group G is some 
curved space, a WZW model is an interacting field theory, similar in spirit 
but different in detail to, say, the 0(3) non-linear a model. 

let us consider such a theory on the cylinder S 1 x R. The WZW action 

is: 



Swzw = f tr (fy crW) + 
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— I e^hridMU^dvUU^dxUU- 1 ) (15.177) 
12tt J D 2 xR 

The second term - often called a Wess-Zumino term - appears to be an 
integral over the 3D manifold given by the solid cylinder, D 2 x R, but, 
in fact, it only depends on the boundary values of U. This is completely 
analogous to the Berry phase term which we have in the action of a single 
spin. 

The equation of motion which follows from the WZW model is 

(U-^U - e^U^dvU) = (15.178) 

The second term in the parentheses results from the Wess-Zumino term in 
the action. The equation of motion is only for the restriction of U to the 
boundary, S 1 x R, which is one way of seeing that the action is independent 
of the continuation from S 1 x R to D 2 x R. 

It is useful to rewrite the equation of motion in terms of z,z: 

dljJ^dU) = 

This automatically implies that 

d (dU U' 1 ) = 

since 

8 (U^dU) = U^ddU - U^dU U' 1 dU 

= U- 1 d(dUU- 1 )U- 1 (15.181) 

Thus, we have right- and left-handed currents 

J a = tr (T a U- 1 dU) 

J a = tr (T a dU U- 1 ) (15.182) 

where the T a s are the generators of the Lie algebra. Note the asymmetry 
between the definitions of J a and J° '. 

We can compute the commutators of the modes of J a , J a by first deriving 
the canonical commutation relations from the action: they are those of a 
Kac-Moody algebra at level k. 

15.14 Coulomb Gas 

15.15 Interacting Fermions 



(15.179) 
(15.180) 



15.16 Fusion and Braiding 
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CHAPTER 16 



Mean-Field Theory 



16.1 The Classical Limit of Fermions 

At low-temperatures, there is a canonical guess for the ground state of 
a system of bosons ip, the broken-symmetry state {ip(x)) / 0. In this 
state, the system becomes rather classical. One can define a classical field 
^ c (x) = {ip{x)) which describes the properties of the system at the level 
of the saddle-point approximation to the functional integral. ^f c (x) satis- 
fies differential equations (the saddle-point condition) which can be solved 
in different experimental geometries or with different boundary conditions, 
just like a classical field. 

With fermions we don't have such an option. A fermion field, x, can't 
have an expectation value. Consider the functional integral for (ip) 



Taylor expanding exp(— S), we obtain only terms with an even number 
of Grassman fields. Therefore, the integrand contains only terms with an 
odd number of Grassman fields. Hence, it must vanish. Introducing small 
bosonic (i.e. physical, at least in principle) symmetry-breaking fields cannot 
alter this conclusion. 

This makes fermion problems intrinsically a little more difficult: there 
is no stable phase which is a natural, 'classical' ground state of the system. 




(16.1) 
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This fact is somewhat obscured by the remarkable stability of the Fermi 
liquid critical point which, though it is a critical point, can almost disguise 
itself as a stable phase. In strongly-interacting fermions systems, however, 
the Fermi liquid can be unstable to several different phases. In such a regime, 
perturbation theory about the Fermi liquid critical point is, of course, hope- 
less. A brute force numerical solution is almost certain to run into difficulties 
of the same origin: in many problems, algorithms will converge very slowly 
because they must decide between many possible competing ground states. 

To understand one or another of the stable phases, one is better off 
starting deep within one of these phases. These stable phases fall into sev- 
eral categories. In this part of the book, we focus on phases in which the 
fermions organize themselves into a bosonic degree of freedom which con- 
denses, breaking a symmetry, and producing a 'classical field' describing 
the ground state. In later parts of this book, we will consider two others: 
localized phases in dirty systems and topologically-ordered phases. 

16.2 Order Parameters, Symmetries 

The simplest bosonic degrees of freedom which can arise in fermionic systems 
are composed of fermion bilinears. Broken symmetry states occur when one 
of these bilinears acquires a non-zero expectation value. It is sometimes 
useful to think of these fermion bilinears as annihilation operators for 'bound 
states' formed by two electrons or by an electron and a hole. The broken 
symmetry state is a state in which the 'bound state' condenses. The reason 
for the quotation marks is that there need not actually be a stable bound 
state. In this section, we will discuss some of the possible order parameters 
in a fermionic system. 

Let us first consider superfluid/superconducting order parameters. As 
we discussed in chapter . . . , a superconductor is is characterized by the break- 
ing of the global U(l) symmetry associated with charge conservation (if we 
ignore for the moment the electromagnetic field and issues of gauge invari- 
ance in the superconducting case). In other words, the symmetry 

i> a ^e i6 *p a (16.2) 

must be broken. This occurs when one of the following bilinears acquires an 
expectation value: 

(MWM-k't)) ( 16 - 3 ) 

Focussing for the moment on the spin structure of the order parameter, 
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we see that there are two categories, spin-singlet and spin-triplet superflu- 
ids/super conductors. The former occur when 

(V«(k, t) ^(-k, t)) = *(k) e a/3 (16.4) 

while the latter are characterized by the expectation value 

(Mk, t) M-k, t)) = *(k) • ajeyp (16.5) 

Fermi statistics requires that in a singlet superfluids/superconductor (k) 
must be even in k, while in a triplet superfluid/superconductor, ^(k) must 
be odd in k. 

In order to discuss the k dependence of the functions ^(k), ^(k), we 
must make some further assumptions about the system. For the super- 
fluid/superconducting case, we will consider a translationally- and rotationally- 
invariant system, since this case applies to 3 He, as well as the case of a system 
on a 2D square lattice. For the case of density-wave order parameters, to 
be introduced later in this section, we will focus on the 2D square lattice. 
Other lattices will be considered in the problems at the end of the chapter. 

Let us first consider superfluids in the continuum. A singlet superfluid 
can be thought of as a condensate of spin-singlet fermion pairs which have 
orbital wavefunction ^f(k). In the continuum, an energy eigenstate of such a 
pair will have well-defined I? , L z (let us ignore spin-orbit coupling) , thanks 
to rotational invariance. It is possible for two different pairing states with 
different orbital and/or spin angular momenta to condense. The simplest 
way for this to happen is if a metal has several bands, and the order pa- 
rameters in different bands have different symmetries. It is also possible, 
in principle, for this to occur within a single band. However, two different 
angular momentum eigenstates would somehow have to become degenerate, 
which would require fine-tuning, so we will not discuss this further here (see 
problems, however?). Once we have determined the L 2 eigenvalue, lit + 1), 
there is still freedom in the choice of L z eigenvalue m. Thus, we can focus 
on angular momentum eigenstates of the form: 

(MKt)M-k,t)} = <p(k) ( d m Yi(k)\ e a/3 (16.6) 

\m=-e J 

where I must be even by Fermi statistics. The cases t = 0, 2, 4 are called 
s-wave, d-wave, and g-w&ve superconductors. Since the <i m 's are arbitrary 
complex numbers, the order parameters for these three cases have 2-, 10-, 
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and 18-component order parameters, d m , respectively. f(k) is the radial 
part of the pair wavefunction. 

In order to allow for spatially inhomogeneous situations, such as those 
resulting from boundary conditions of the presence of a magnetic field, we 
should allow the fermion fields to be at arbitrary momenta, not necessarily 
k and — k. 

<^ a (k,t)^(-k / ,*)> = e a/ ^(|k-k'|) x 

/0 eiR(k+k ° ( £ dm(R)Y^)\ (16.7) 

\m=— £ / 

Here, we have Fourier transformed the center-of-mass momentum of the pair 
in order to reveal the possible spatial dependence of d m (R). 

The underlying symmetry of the system is J7(l) x 0(3) x Zj, where 
Zj is the Z2 symmetry of time-reversal (there is also an SU(2) spin sym- 
metry which is unaffected by singlet ordering). This is broken down to 
Z2 x U(l) x Zi x Zj. The order parameter is composed of a product of 
two fermion operators, so the Z2 £ U(l) transformation -ip — > — ip leaves the 
order parameter unchanged. U(l) £ 0(3) is the subgroup of rotations about 
the direction of the angular momentum vector of the pair. Z^ € 0(3) is the 
discrete set of rotations which leave Y r e n (k) invariant. Finally, ... needs to 
be finished 

In a triplet superfluid in the continuum, we have 

(Mk, t) M-k, t)) = <p{k) ( Y K k )dm) ■ ( 16 - 8 ) 

\m=-e ) 

Thus, an I = 1 triplet superfluid in the continuum has an 18-component 
order parameter d m . 

The symmetry group of the system is U(l) x 0(3) x SU(2) x Zj. The 
symmetry-breaking pattern depends on the d m s. In the A phase of 3 He, 
do = d- = 0, while d + = d, for some real vector d: 

{i/j a {k, t) i>p{-k, t)) = <p(k) (k x + iky) d- a a 7 e 7/3 (16.9) 

The angular momentum of the pair thus points in the z-direction (which 
we have arbitrarily chosen, without loss of generality; any other direction 
is equally good). The pairs have spins fj + jj in the d-direction, i.e. the 
total spin of each pair has vanishing component along the ci-direction. (We 
have ignored spin-orbit coupling, which would break independent orbital 
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angular momentum and spin conservation down to total angular momen- 
tum conservation. The spin-orbit interaction of 3 He favors alignment of d 
with the angular momentum.) In this phase, the remaining symmetry is 
U(l) x 17(1), where the two U(l) factors correspond to spin rotations about 
the ^-direction and rotations about the z-axis combined with gauge trans- 
formations (the former yield a phase factor which is cancelled by the latter) . 
Time-reversal is broken by the selection of m = +1, which transforms into 
m = — 1 under T. 

In the B phase, do = z, d± = (x=F iy)/\J~2 or, simply, 

(Mk,t)M-k,t)) = if(k)k-a^e l(3 (16.10) 

The pairs have orbital angular momentum £ = 1 and spin angular momen- 
tum S = 1 which add together to give total angular momentum J = 0. 
Thus, the B phase is invariant under 50(3) x Zj, where the SO (3) is the 
group of simultaneous rotations of both space and spin. 

The forms which we have chosen are actually too restrictive. To be 
completely general, we should allow the fermion operators to be at different 
times t, t' and allow the right-hand-side to have non-trivial dependence on 
t — t'. If it is an odd function of t— t', so that the correlation function actually 
vanishes for t = t', then the order is called odd-gap superconductivity. 

Now, let us consider a system of electrons in 2D on a square lattice of 
side a. The symmetry group of the square lattice is D4, with 8 elements: 4 
rotations, including the identity - by 0, tt/2, tt, 3tt/2 - and 4 reflections - 
through the x-axis, the y-axis, and the lines x = ±y. An s-wave supercon- 
ductor has an order parameter of the form: 

(il> a QL,t)M-k,t)) = *oe a p (16.11) 

Turning to p-w&ve superconductors, we see that the analog of the A 
phase is: 

{i(j a (k,t) ipp(—k,t)) = ^0 (sin k x a + i sin k y a) d-a^e^p (16.12) 

A p x superconductor with vanishing spin component along the <i-direction 
is even simpler: 

(i/} a (k,t)ipj3(-k,t)} = ^0 (sin/^a) d ■ <7 a 7 e 7/3 (16.13) 

Note that the order parameter vanishes along the direction k x = 0. A p y 
superconductors has sin k x a replaced withsinfc y a 
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A d-wave superconductor must be a spin-singlet superconductor. A 
d x 2_ y 2 superconductor has 

(ip a {k, t) ipp(-k, t)) = A (cos k x a - cos k y a) e af3 (16.14) 

while a d xy superconductor has cos k x a — cos k y a replaced by sin k x a sin k y a. 
A d x 2_ y 2 + id xy superconductor breaks T with the order parameter: 

(Va(M)#(-M)> = 

Ao (cos k x a — cos k y a + ismk x a sinfe^a) e Q/ g (16.15) 

We can define analogous neutral order parameters which do not break 
U(l). However, the spin structures will no longer be determined by Fermi 
statistics. Let us first consider the singlet orderings: 

(r\k + Q, t) ipp(k, t)) = $ Q (k) 51 (16.16) 

If 3>q(1c) = 1 or if its integral over k is non-zero, the state is a charge- 
density-wave (CDW): 

{pm = /^(^ + ^)^(M)) = /(02^( k )^° ( 16 - 17 ) 

The triplet orderings are of the form 

(rHk + Q,t)Mk,t)) = $ Q (k) -3% (16.18) 

If ^g(k) = 1 or if its integral over k is non-zero, the state is a spin- density- 
wave (SDW): 

(S(Q)) = /^(^(* + G>t)MM))*g = /^$ Q (k) 

(16.19) 

Let us consider these order parameters in more detail on the square 
lattice. f(k) is an element of some representation of the space group of the 
vector Q in the square lattice. A singlet s-wave density wave is simply a 
charge-density- wave: 

^{k + Q,t)^{k,t)) = ^ Q 81 (16.20) 

In the higher angular momentum cases, we must distinguish commmensurate 
and incommensurate ordering. For commensurate ordering such that 2Q is 
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a reciprocal lattice vector, e.g. Q = (ir/a, 0) or Q = (n/a,n/a), we can take 
the hermitian conjugate of the order parameter: 



^P(k,t) ip a (k + Q,t)J = $q f*(k) 5p 

+ Q + Q, t )i, a (k + Q, t)) = % f*(k) 51 

<S> Q f(k + Q)6% = $* Q f*(k)5% (16.21) 

Therefore, for Q commensurate 

f(k+Q) $h 



f*(k) " $Q 



(16.22) 



Hence, if f{k+Q) = —f*(k), <5q must be imaginary. For singlet p x ordering, 
this will be the case if Q = (7r/a,0) or Q = (ir/a,ir/a). For singlet d x i_ y i 
ordering, this will be the case if Q = (ir/a,7r/a). If f(k + Q) = f*(k), $q 
must be real. For singlet p x ordering, this will be the case if Q = (0,-ir/a). 
For singlet d xy ordering, this will be the case if Q = {it /a, it /a). 
A commensurate singlet p x density-wave state has ordering 



^{k + Q,t)^p{k,t)J = $ QS mk x a6% (16.23) 
The commensurate singlet p x + ip y density-wave states are defined by: 

[ip a] (k + Q,t)ipp(k,t)) = $ Q (sin k x a + i sin k y a) 5% (16.24) 



Further insight into these states is obtained by considering their real- 
space forms. From (16.22), a commensurate singlet p x density-wave state 
with Q = (n/a,0) must have imaginary <£q: 

ip^ a (x, t) ippix + ax, t) — ^ a (x, t) ij)p(x — ax, t) 

... + \§ Q \ e&^Sfi (16.25) 

The singlet state of this type breaks no other symmetries; it is usually called 
the Peierls state or bond order wave. If Q = (0, vr/a), &q must be real. 

ip^ a (x, t) ipp(x + ax, t) — ^ a (x, t) ij)p{x — ax, t) S 

...-i\$ Q \ e^51 (16.26) 

As a result of the i, the Q = (0,ir/a) singlet p x density- wave states break 
T. However, the combination of T and translation by an odd number of 
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lattice spacings remains unbroken. The same is true of the commensurate 
singlet p x + ip y density-wave states. Examples of commensurate and incom- 
mensurate singlet p x and p x + ip y density-wave states are depicted in figure 
??. 

Similarly, the commensurate singlet d x i_ y i density- wave states have 
^ V Qt (k + Q,t) V>/j (M)) = $q (cosk x a- cos k y a) 5% (16.27) 
while the commensurate singlet d x 2_ y 2 + id xy density- wave states have 

(^ Qt (fc + Q,*)^(M)) = 

(cos k x ci — cos kyd + i sin fc^o sin ky(i) 5p (16.28) 

The commensurate Q = (7r/a,7r/a) singlet d x 2_ y 2 density-wave states 
must have imaginary $>q, according to (16.22). In real space, it takes the 
form: 

^ a (x, t) iP[3(x + ax, t)^ + ^ a (x, t) iPptx - ax, t)} - 
^ a (x, t) 4> p (x + ay, t)) + (^ a (x, t) 4>p(x - ay, i)) = 

••• + |l^Q|e^^ (16.29) 

As a result of the i, the singlet d x 2_ y 2 density-wave breaks T as well as 
translational and rotational invariance. The combination of time-reversal 
and a translation by one lattice spacing is preserved by this ordering. The 
commensurate Q = (n/a^/a) singlet d x 2_ y 2 density-wave state is often 
called the staggered flux state. There is also a contribution to this correlation 
function coming from ip'(k)ip(k) which is uniform in space (the . . .); as a 
result, the phase of the above bond correlation function - and, therefore, 
the flux through each plaquette - is alternating. The commensurate Q = 
{n /a, it /a) singlet d xy must have real $q; therefore, it does not break T. 
On the other hand, the singlet d x 2_ y 2 + id xy state does break T. Note that 
the nodeless commensurate singlet d x 2_ y 2 +id xy density- wave state does not 
break more symmetries than the commensurate singlet d x 2_ y 2 density- wave 
state, in contrast to the superconducting case. Examples of singlet d x 2_ y 2, 
d xy , and d x 2_ y 2 + id xy density-wave states are depicted in figure ??. 

The incommensurate cases will be considered in the problems at the end 
of the chapter. 

It is also possible to break spin-rotational invariance without breaking 
translational symmetry or £7(1) gauge symmetry. This is accomplished in a 
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ferromagnet: 

(#*t(M)#(M)) =M-a| 5 (k) (16.30) 
Integrating over k, we have: 

(S)=M (/(0 5(k) ) (16 - 31) 

It is also possible to break spatial rotational symmetries (either in the 
continuum or on the lattice) without breaking translational symmetry or 
spin rotational symmetry: 

(y°*(k,t)1>f,(k,t)) = 8$f(k) (16.32) 

where /(k) transforms non-trivially under rotations. 

16.3 The Hubbard-Stratonovich Transformation 

There is a formal transformation, the Hubbard-Stratonovich transformation, 
which allows us to introduce a bosonic degree of freedom to replace one of 
the possible bilinear combinations of the fermions. When the occurrence of 
such an order parameter causes a gap in the fermionic spectrum, the fermions 
can be integrated out and an effective theory for the bosonic variable can 
be derived. This theory can then be solved by the usual methods applied 
to bosonic systems, such as the saddle-point approximation. This approach 
has a good chance of succeeding when one of the possible bosonic degrees of 
freedom dominates, in particular, when it condenses and the others remain 
gapped. By studying the instabilities of the RG flows of a Fermi liquid in a 
periodic potential, it is sometimes possible to detect such a tendency. This 
approximation can often become exact in some kind of large- N limit. 

To illustrate this transformation, let us consider the following integral 
over 2n Grassmann variables Xa, Xa'- 

1 = j\\dxadXa e*-*» + sr(x«Xa) 2 (16.33) 

We can rewrite this integral as 



/ = M J ]T d Xa dxa d<pe XaX «-v XaX *-^ 2 



(16.34) 
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where N is a normalization constant. By performing the Gaussian integral 
over ip, we recover the integral (16.33). This, in a nutshell, is the Hubbard- 
Stratonovich transformation; we will be applying it to functional integrals. 
The Grassmann integrals are now Gaussian, and they may be performed: 

I = AT J d<p(l-<p) n e-^ 2 
= M j dipe nln(1 '^-^ 2 (16.35) 

Thus, we have exchanged a non-Gaussian Grassmann integral for a non- 
Gaussian ordinary integral. This may not seem like such a big success. 
However, we can now use techniques such as the saddle-point approximation 
to evaluate the ordinary integral. 

Before doing this, however, note that we could have decoupled the quartic 
term in another way: We can rewrite this integral as 

^(XaXaf = e ^k(XaXa)(XbXb) 

= M j d(p ab eXoVakXi-S^i (16.36) 
Now the Grassmann integral is transformed into an integral over the matrix 

I = M j dip ab e n HSab-Vat,)-^ ab (i 6 .37) 

The two integrals (16.35) and (16.37) are equal so long as they are per- 
formed exactly. However, different approximations are suggested by the 
forms of these integrals. In the analogous field-theoretic context, the under- 
lying physics will dictate which one is a better starting point. 

16.4 The Hartree and Fock Approximations 

Let us now consider the saddle-point evaluation of these integrals. The 
saddle-point condition for (16.35) is: 

-ip-^— (16.38) 

The two saddle-point values of (p are the golden number and the negative 
of its inverse, <p s .p. = (1 ± v / 5)/2. Adding the contributions from both 
saddle-points, we have 

/ = M { ( - 1)n , e -H^)+" K^H 
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+ H) " /9 e-< 1+ ^) +n K^) } (16.39) 

[(^5-l)/2] n/2 

In the physical context in which we will be using these ideas, we will 
be considering Grassmann functional integrals which can be rewritten as 
bosonic functional integrals - or 'decoupled', since the remaining Grassman 
integral is Gaussian - in a variety of ways. The different bosonic fields 
which we introduce will be the different possible order parameters of the 
system. Mean-field-theory - or the Hartree approximation - for any of these 
order parameters is simply the saddle-point approximation for the bosonic 
functional integral. The saddle-point condition is then a 'gap equation' (a 
name whose aptness will become clear when we look at examples). 

This approach can also be used to decouple an interaction which does 
not lead to the development of a non-trivial order parameter. For instance, 
a density-density interaction can be decoupled so that the electrons interact 
with a bosonic field. At the saddle-point level, the bosonic field is equal to 
the electron density. This is the Hartree approximation. Alternatively, the 
interaction can be decoupled so that the bosonic field is equal to the product 
of a creation operator from one density factor and an annihilation operator 
from the other density factor. This is the Fock approximation. 

16.5 The Variational Approach 

There is an equivalent approach within the framework of canonical quan- 
tization. One introduces a trial ground state, |0), which is based on the 
anticipated order parameter. The size of the order parameter is the varia- 
tional parameter which is tuned to minimize 

The condition for minimizing (H) is the same as the saddle-point condition 
in the path integral approach. 

The basic form of the trial wavefunction is, in the superconducting case: 

i*o>= n(^ + ^*T^-*j i°) ( ie - 41 ) 

k 

with where the u^s and v^s are variational parameters with respect to 
which ($!q\H |\Po) is minimized. The wavefunction is normalized by taking 

4 + 4 = 1 (16.42) 
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Thus, there is only one free variational parameter for each k. The combi- 
nation Ukfk is a convenient way of parametrizing it. Calculating the order 
parameter in (16.41), we see that it is given by u^v^. Thus, the variational 
method selects a ground state by minimizing the energy with respect to the 
order parameter, while the saddle-point approximation of the previous sec- 
tion finds a ground state by minimizing the action with respect to the order 
parameter. 

In the case of a density- wave at wavevector Q, the trial wavefunction 
takes the form: 




(16.43) 



k 



CHAPTER 17 



Superconductivity 



17.1 Instabilities of the Fermi Liquid 

When a fixed point has a relevant perturbation, this perturbation generally 
leads to a fundamental reorganization of the ground state. We saw a trivial 
example of this with a shift of the chemical potential of a Fermi liquid. When 
the instability is due to interaction terms, the general strategy is to use the 
RG to go to low energies so that the irrelevant variables have all become 
small and the relevant variable is dominant. The problem with a single 
relevant interaction must then be solved by finding a new saddle-point (i.e. 
mean field theory), the variational method, or some other non-perturbative 
method. This approach has proven very successful in the study of ordering 
in condensed matter physics. (Sometimes, there are competing instabilities 
in which case it is very difficult to find a new saddle-point or an appropriate 
variational ansatz. This occurs in the case of a ID system of fermions.) In 
the case of electrons in a solid, the Fermi surface need not be rotationally 
symmetric, and spin- and charge-density wave instabilities are possible when 
the Fermi surface satisfies certain special conditions ('nesting'). If the Fermi 
surface is rotationally symmetric, there is only one instability, as we found 
earlier: the Cooper pairing instability. 

Consider the action of electrons in D = 2 with F = but non-zero V, 




d 2 k de 
(2vr) 2 2^ 



ipl(e, k) (ie - v F k) ip a (e, k) 
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- 1 ^ ir if if ^ + (^' ^o^lces, -^) ^(^, fc') vt(^, -^^(^) 

where V(k, k') = V(9\ — # 2 ) is a function of the angles only. Unlike in 
previous chapters, where we dealt with spinless fermions, we now consider 
spin 1/2 electrons. 

In chapter 14, we showed that the Fourier modes of V{9\ — 6> 2 ) satisfy 
the RG equation: 

dt 2irv F m [ ' 

When negative, these are relevant. In the next section, we will find the new 
saddle point which is appropriate for the case in which V is relevant. We 
will also mention briefly the equivalent variational ansatz (which was the 
historical method of solution). 

17.2 Saddle-Point Approximation 

We introduce a Hubbard-Stratonovich field ^f(k,uj) to decouple the BCS 
interaction: 

S = J (2^2 Itt^* 6 ' ^ ^ £ ~ VF ^ ^ 

+^{e 1 ,k')^ l {e 2 , —k') ^(ei, e 2 , k) + * f (ei + e 2 , fc)*(«i + e 2 , <|f.3) 
We now make the change of variables: 

A(ei + e 2 , k) = J ^ V(k, k') *t( Cl + C2) fc') (17.4) 

Then, the action can be rewritten: 

- y (2^ w? ^ [ ^ (ei! fc>!(e2 ' - *° A(ei + e2 ' fe) 

+^ T (€i, fc>i(e 2 , —A;') At( ei + e 2 , fc) + At( Cl + e 2 , fc)^*, fc')A( ei + <&fc§] 
where V~ 1 (k, k!) is the inverse of V(k, k'): 

I -0j2 V ~ 1 ^ k ') V (k',k") = 5(k-k'') (17.6) 
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Since the action is quadratic in the fermion fields we can integrate out 
the fermions to get an effective action 5 [A]: 

S[A] = - Trlii {{ief - (v F k) 2 - \A{k)\ 2 ) + 

<lH ' ' n ' de A^e,k)V-\k,k')A(e,k') 



I 



(2vr) 2 (2tt) 2 2tt 
/(2^2^ ln ^ )2 - ( ^ )2 - |AW|2 ) + 



/ 



d'i <H J *1 A t (ejfc ) F -i (A . )fc /) A(ejfc /) (17.7) 



(2tt) 2 (2tt) 2 2^ 



We look for a frequency-independent solution, A(e, k) = A(k) of the saddle 
point equations, 

g = (17.8) 
From (17.7), we have the saddle-point equations: 



9) 



S Wf^ {ief - (v F kf - \A{kW ~ I W? (17 ' 

At zero-temperature, the e integral in the first term can be done (at finite- 
temperature, we must do a Matsubara sum instead), giving: 

/l?7^W^/(W F "' (MW) (mo) 



or 



\p< ntmm =A {k) (mi) 



(2^) 2 V^FfcO' + IA^')! 2 

The is the BCS gap equation. It determines the order parameter A which 
breaks the U(l) symmetry A — > e %e A of the action (17.7). 

For V attractive, i.e. V > 0, this equation always has a solution. Con- 
sider the simplest case, of an s-wave attraction, V(k, k') = V. Then the gap 
equation reads: 

d 2 k' VA . , 1W10 , 

A (17.12) 



/ 



(2vr) 2 ^{v F k'Y + \Aj 
or, 



d 2 k' 11 , 

(17.13) 



(2tt) 2 ^( VF k>y + \A\ 2 V 
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Since the left-hand-side is logarithmically divergent at the Fermi surface if 
A = 0, there is always a non-trivial saddle-point solution when V > 0. 



m* f 
2W 



d£ . 1 = -*- (17.14) 



or 

A = ^— (17.15) 

bum m „ v 

If the attraction is weak, m*V /2-k <C 1, then 

2"7T 

A = 2Ae"~ (17.16) 

Note that the gap is not analytic in V; it could never be discovered in 
perturbation theory. 

As you will show in the problem set, the finite-temperature gap equation 

is: 

d 2 k' V{k,k')A(k') u (3E k , 



I 



(2vr) 2 y /( VF k')* + \A(k')\ 2 
with 



tanh — — ^- = A(k) (17.17) 



E k = ^/(v F k) 2 + \A(k)\2 (17.18) 
For an s-wave attraction, this gap equation has solution A = when: 

£ /e^-Ltam4 = i (17.19) 

2^ J 2 V K ' 

So the critical temperature for the onset of superconductivity is: 

2-7T 

r c = 1.14Ae~^v (17.20) 



17.3 BCS Variational Wavefunction 

For purposes of comparison, consider the route taken by Bardeen, Cooper, 
and Schrieffer. They wrote down the wavefunction 

\*o) = H(u k + v k il>l^- ki ) |0> (17.21) 
k 

with where the u k s and v k s are variational parameters with respect to 
which ($o| H\*Sfo) is minimized. The wavefunction is normalized by taking 



u l + v t = 1 



(17.22) 
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For notational simplicity, we assume that the k's are discrete (as they are 
in a finite-size system). The Hamiltonian which follows from (17.1) is: 

H = E&^iUfcr " E F ( fc > k ')^-kl^-k>l (17-23) 

h h h> 

This Hamiltonian is called the BCS reduced Hamiltonian. It is the Hamil- 
tonian which only contains the relevant interaction. The irrelevant and 
marginal interactions have been dropped. The expectation value of the 
Hamiltonian is: 

<*o| H |* ) = YJ 2 ^ ~ Y, V ( k > k')u k v k Uk>v k > (17.24) 

h h h' 

Hence, 



k /„/ 



UkUk'Vk' + 2- — VkUk'Vw 



= Avktk ~ 2J2v(k, k') (^S—Hk\ Uk , Vk , (17.25) 
k> \ u k J 

The minimum of (^q \ H \ occurs when 

^kUkVk = J2 v ( k , k ') ( u l ~ v l) u k'V k ' (17.26) 



If we define A(fc) by 



or, equivalently, 



u k v k = . A(fc) (17.27) 



2^ + |A(fc)| 



with 



£ fe = ^ + |A(A:)| 2 ( i 7 . 29) 
Then we can rewrite the minimization condition as the BCS gap equation: 

Yy{k, k') A(fc0 = A(k) (17.30) 
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(a) 



(b) 



(c) 



Figure 17.1: The graphical representation of (a) Q (b) T and (c) . 

17 A Condensate fraction and superfluid density** 

17.5 Single-Particle Properties of a Superconduc- 
tor 

17.5.1 Green Functions 

When A takes a non-zero, frequency-independent value, the action for the 
fermions is: 

/d 2 k de 

- 4>\(ie,k')i/j\(-ie,-k') A(fc) - ^ T (ie, k'^-ie, -k') A^tpl) 

As usual, the propagator is obtained by inverting the quadratic part of the 
action. This is now a matrix, with an inverse which gives 

A.tr-- r * e + ^ 



Q aa ,{ie,k) = (ipl(ie,k)ip a i(ie,k)) =5, 



(ie)2-3-|A(A0P 
F aa ,{ie,k) = (ij) a (ie,k)il) a i(-i€,-k)) = e acr > — i l7.32) 

We denote G(ie, k) by a line with two arrows pointing in the same direction. 
We denote J-(ie, k) by a line with two arrows pointing away from each other 

and J^(ie, k) = (^liie, k)ip^,(—ie, —k)^ by a line with two arrows pointing 
towards each other. The electron spectral function is given by 



A(k,e) = Im 



e + 6 



e + i5f -e k ~\Mk)\ 2 , 
u 2 k S(e-E k ) + v 2 k 5(e + E k ) (17.33) 
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which shows that the electron has spectral weight u\ at E k and spectral 
weight v\ at — E k . 

Another way of understanding the single-particle properties of a super- 
conductor is to diagonalize the action. The action is diagonalized by the 

7(fc)'s 

7 T (/c,e) = u k ^(k,e) - v k ip\(-k,e) 

7|(fc,e) = u k ^i(k,e) + v k ip\(-k, e) (17.34) 

/d 2 k df 

— 4 (k, e) (ie - E k ) la (k, e) (17.35) 

The 7(/c)'s have propagator: 

U( i e,k) 7a ,(ze,k)) = -^— (17.36) 

The 7(/c)'s are the basic single-particle excitations - 'Bogoliubov-DeGennes 
quasiparticles' - of a superconductor; they are superpositions of fermions 
and holes. In the case of electrons, the basic excitations have indefinite 
charge, since they are a superposition of an electron and a hole. Although 
they are not charge eigenstates, they are spin eigenstates. 

Note that E k > 0. When £ k » A, u k — > 1, v k — > 0, so jt(k) creates a 
fermion above the Fermi surface, costing positive energy. When <C —A, 
u k — > 0, v k — > 1, so Tcr{k) creates a hole below the Fermi surface, also costing 
positive energy. 

For some purposes - such as the Hebel-Slichter peak in NMR - we can 
ignore the fact that they are a superposition of an electron and a hole and 
treat the superconductor as a semiconductor with energy bands ±E k . Since 
the density of single quasiparticle states, 

^ = £- , lEl e(\E\-A) (17.37) 
dE 2vr yjE 2 - A 2 

is divergent for \E\ — > A and vanishing for < A, the semiconductor 
model predicts sharp increases in these quantities for T ~ A and exponen- 
tial decay for T <C A. However, for other properties - such as the acoustic 
attenuation - the mixing between between electron and hole state ('coher- 
ence factors') is important. The coherence factors can cancel the density of 
states divergence at \E\ — > A, and there is no enhancement for T ~ A. 
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Figure 17.2: The two diagrams which contribute to the spin-spin correlation 
function of a superconductor. 



17.5.2 NMR Relaxation Rate 

According to (6.93), the NMR relaxation rate is given by: 

- / 

T, 



k = I^ Mq) ^ Uq ' W) <17 - 38> 



The spin-spin correlation function (S + (q,iu> m ) S-(—q, —iuj m )) is given by 
the sum of the two diagrams of figure 17.2. Assuming that A(k) = A, this 
is: 



(S+(q,iw m )S-(-q,-iu m )) = ~22 J j^-^ Gu(ie n , k) Q^{ie n + iu m , k + q) 

1 . f d?k 

+ J J^3 :F n (ie n ,k)J : 'l^ie n + iuJ m ,k + q) 



or, 

,o i ■ \ c i ■ w Isr^ [ d3 k ie n +£,k ie n + iuj m + £ k+q 

{ S + (q, i u m )S-(-q,- i u m )) = -Y:j _ ( . £b)2 _^_ |a|2 (i£n + iWm)2 _^_ |A|2 

d 3 k A A 



P^J (2VT)3 ( ien )2_£2_| A | 2 ( jen + iwWl )2_^_| A |2 

(17.40) 

If we replace the sums over Matsubara frequencies by contour integrals which 
avoid z = (2n + l)ivi/(3, 

I dz m f d 3 k z + £ k z + iuj m + £ k+q 



T c 27ri ry, j (2ky { Z y-e k -\&\ 2 (z+ium) 2 -e k+q -\&\ 2 

f dz f _d?k A A 

+ f c 2^ nF{z) J (2^)3 ( Z y -e k ~ |A| 2 (z + iu m ? - H +q ~ |A| 2 
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(17.41) 

these integrals receive contributions only from the poles at 



Z = ±y/% + |AP 



z = -^ n ±^ +9 + |A| 2 (17.42) 

Hence, 

,q i ■ \ q t ■ w f d 3 k i k + E k E k +iu m + £ k+q 

{S + {q,iLJ m ) S-(-q,-iu; m )) = / -—3 n F {E k ) — — . 

J (27r)- J 2E k (E k + iuj m y - El 



f d?k £ k - E k -E k + ibJ m + 6c+g 

+ J (2n)3 nF[ ^ -2E k (-E k + iu m )*-El +q 
f d 3 k ^ k+q + E k+q E k+q — im m + £ k 

+ J Wf^^^E^ (^3—0^2 
/" ri 3 fc — -^fc+g —E k + q — iuJ m + gfc 

"7 (2vr)3 nM ^ j -2£ fc+9 (£ fe+? + *u; m ) 2 -£ 2 

/• d 3 k ^ A 

J (2*f nFW 2E k (E k + iu m r-El 



d 3 k 

+ J J2vf nF[ ~ Ek) ^2E k (-E k + uj m y - El +q 
d 3 k ,„ s A A 



f d 6 k _A A 

+ J (27T)3 "M^W 2^, +g (£ fe+? - Z Wm )2 - E| 

If we now take iuj m — > w + i<5, and w < 2A (and, thereby, dropping terms 
such as 5{oj — E k — E k+q ) which vanish for uj < 2 A), we obtain: 

X + MM = J J^ME k ) S(u + E k -E k+q ) 

~ I As *F(-E k ) to - Ek) { ~ Ek + M §{u + Ek _ Ek+q) 
J (2tt) 3 M k> 2E k E k+q y k+q > 

, f d3k n (p , {j k+q + E k+q ){E k+q + i k ) 
+ J J^ nF ^ Ek ^ 2£^ 5(u + E k -E k+q ) 

f d3k r, ( F \ (g fc+g - E k+q ) (-E k+q + &) , , 
'J J2v? F{ k+q) ^E k S(u + E k+q -E k ) 

/d 3 k A 2 
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f drk A 2 

-J J^ nF{ - Ek) mE^ 6{uJ + Ek - Ek ^ 

f d 3 k A 2 
+ J Wf nF{Ek+ « ] 2E^E k 5{UJ + Ek ~ Ek ^ 

/d s k A 2 
nFi ~ Ek+q) V^E k ^ + Ek ^ 

dropping terms which are odd in or ^ k+q , an d using tif{— E k ) = 1 — 
nF{Ek), we have: 

x'l-(^) = / " n F {E k+q )) (l + C f E + k % + k ^ 2 ) S(u + E k -^4|) 

Let us assume that A(q) = A. Then, dropping the term linear in £ k and 

1 4 f d 3 k' d 3 k , , , , u / A 2 \ 

sir = « J w p " " F(Ei,)) + ) a(u,+ £ ' (i7fl * ) 

or, using the single-particle density of states to re-write the momentum 
integrals as energy integrals, 

i _ a (nf_\ 2 r A dE r A dE , E E' f A 2 \ 

T X T~ u\2k) J a J A V^ 2 - A 2 V^' 2 - A 2 V + 2EE'J 

x (tif{E) — n^-E 1 ')) S(uj + E(y7ELTi) 

or 

J- = 4 f^V f A dE - E E + » (l + —^—) 

T{T u\2ir) J a V^ 2 -A 2 ^/{E + uf- A 2 V 2E{E + u) ) 

x (n F (E) - n F (E + uj)) (17.48) 

For w — > 0, we can write this as: 

1 /m*\ 2 f A E E + uj ( A 2 \ d 

TYT ~ A \ 2^) J A dE 7W=W ^(E + .y-A* V + 2E(E + U )) dE ME) 

(17.49) 

For T — > 0, the right-hand-side is exponentially suppressed as a result of the 
dn F (E)/dE, and 

~ (17.50) 
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For T ~ A, the exponential suppression is not very strong so the density of 
states divergence is important. In fact, for u> = 

lim J- = A ( — V tdE—^— ( 1 + ^r) ^-n F (E) (17.51) 
u,^o TiT V 27r / J A E 2 - A 2 \ IE 2 ) dE y M ; 

which is a divergent integral at E = A. For realistic values of u, there 
is a moderate, but clearly observable increase of \jT\ for T < T c with a 
maximum which is called the Hebel-Slichter peak. 



17.5.3 Acoustic Attenuation Rate 

Suppose we compute the acoustic attenuation rate, which is essentially the 
phonon lifetime. Phonons are coupled to the electron density, so the phonon 
lifetime is determined by a density-density correlation function. This, too, 
is given by the diagrams of figure 17.2. However, since there are density 
operators rather than spin operators at the vertices of these diagrams, there 
is a crucial minus sign arising from the ordering of the electron operators: 

1 x f d? k 

{p(q,iv m ) p(-q,-iu m )} = J Gllfan, k) Q^{ie n + iu m , k + q) 

J ^^^ni^^^iien + iuj^k + q) 

(17.52) 

The acoustic attenuation rate, a, of a phonon of frequency u is essentially 
given by 



a 



where g{q) is the electron-phonon coupling. From our calculation of l/T±, 
we see that this is (assuming constant g): 

fm*\ 2 f A irn E E + uj ( A 2 \ d 

(17.54) 

As a result of the —sign, we can take the to — ► limit: 
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as B ' (B) <17 ' 55> 

As in the case of 1/Ti, this is exponentially decreasing at low, T, 

a~e^r (17.56) 

However, the density of states divergence has been cancelled by the quantum 
interference between particles and holes, so there is no enhancement for 
T ~ A. Since the underlying quasiparticles are a superposition of electrons 
and holes such that their charge vanishes as the Fermi surface is approached, 
their contribution to the density-density correlation function is suppressed. 
This suppression cancels the divergence in the density of states. On the 
other hand, the quasiparticles carry spin 1/2 (since they are a mixture of an 
up-spin electron and a down-spin hole) so their contribution to the spin-spin 
correlation function is unsuppressed; hence the density of states divergence 
has dramatic consequences leading to the Hebel-Slichter peak. 



17.5.4 Tunneling 

Tunneling is a classic probe of the single-particle properties of an electron 
system. Let us suppose we connect a superconductor on the left with another 
system - which may or may not be a superconductor - on the right. An 
approximate description of the coupling between the superconductor and 
the other system is given by the tunneling Hamiltonian: 



d^k d 3 k' r i 

7^)3 7^)3 [*(*> k ') xAk) + t*(k, k') Xa {k) ri(k) 

= B + B ] (17.57) 



where ipt{k) is the creation operator for an electron in the superconduc- 
tor and x<j{k) is the creation operator for an electron in the other system. 
t(k,k') is the tunneling matrix element for an electron of momentum k in 
the superconductor to tunnelin into a momentum k! state in the other sys- 
tem. Tunneling occurs when there is a voltage difference, V, between the 
superconductor and the other system, 

Hy = V J ^L^(k)Mk) 

= VN L (17.58) 
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The current flowing between the superconductor and the other system is 
d 3 k d 3 k' 



I = i 



I 



(2vr) 3 (2tt) 3 
= i(B- B^ 



t(k, k') ^(k) Xka ~ t*(k, k') X a(k) ^ a (k) 



(17.59) 



Following the steps by which we derived the conductivity and other response 
functions in chapter 7, we see that the current, I(t) computed to linear order 
in Ht is given by: 



(/(*)) = (T {e iVtN ^-~ HT } I(t)T[e- tVtNL ~ l ^ HT ] 



Ht), [ 



iVtN L 



(17.60) 



Substituting the above expressions for / and Ht, we have: 



(i(t)) = rdt'e(t-t') { 

J — oo 

fOO 



e ieV(t'-t) i 



- e ieV(t-t') i 



Bt(t),B(f)])} 



+ 



B(t),Bt(0 

I" di! 6(t - t') { e ~-W) i ( [B(t), B(t')] ) - e -W) i ( [Bt(t), B\t')] )} 

(17.61) 



Suppose that t(k, k') = t. Then the real part of the current is 

r 2t \ f d 3 k d 3 k' ^„ „ ^ „ 1 
1 / JZk) 3 J^tt) 3 £-J*L{k, ie n )g R (k, ie n - lu) > 

+ t 2 lmle 2ieVt I S^^^L(k,ie n )^ R (k,ie n -ico)\ 



(2tt) 3 (2^) 3 



(17.62) 



Converting the Matsubara sum in the first term to an integral, analytically 
continuing, and taking the imaginary part (as we have done so often before) , 
we have: 

/d 3 k d 3 k' f°° de 
J^f (2^)3 J_Jhi Al ^' C + eV) ARik '' e) [UF ^ ~ nF(e + &V)] 



- /" h»{< 2 " VI I 7^ ^4 ^L(k,ie n )^ R (k,ie n - ico) 



(2vr) 3 (2tt) s 



iu— >i<5 

(17.63) 
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Let us first focus on the first term. We will call this current Ie since it 
results from the tunneling of electrons. It can be rewritten as: 



2 m* L k F m* R k F 

Ie = t 



. j _ ^ 2 _ 2 / #k / / ^ [«fc ^ + ^ " £*) + ^ <5(e + eV + E k )] 

x A R (k',e)[n F (e)-n F (e + eV)} (17.64) 



Suppose the system on the left is a Fermi liquid, with 

A R (k/,e) = 6{e-&) (17.65) 

Then, 

Ie _ (2 ^ «^J~ iE _E_ [nr{E) _ ^ _ evm m) 

For T = 0, this vanishes for < A and asymptotes / oc V for V large. For 
T finite, / is exponentially small for V < A. If the system on the right is 
also a superocnductor, we find: 

lE = t2 ^ ^ I dE % > = > ~ nrlFVim 



^ 2 ~ A| y/(V - Ef - A| 



This is exponentially small (vanishing at T = 0) for eV < A^ + A#. 

The current Ie resulting from the tunneling of electrons can be under- 
stood in terms of the semiconductor. However, the current described by the 
second term in (17.63) cannot. It vanishes unless the system on the right is 
a superconductor. We call this current Ij, since it was first discovered by 
Josephson. 

I j = t 2 Im |e 2 ^ / 70 £^ ie n )F R (k, *n ~ iu>) } 

(17.68) 

This is one of the few cases in which it is advantageous to do the momentum 
integrals first. Let us assume that |A^| = |A^| = A, A^ = A^e*^ and 



= 2 / m*k F \ 2 f e2teVt ttA l ttA r 1 
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= t 2 (^) 2 Im{e—*(-A) 2 ^tanh^ 

= t 2 (^-j ||A|tonh^J sm(2eVt + <t>) (17.69) 

The Josephson current results from the tunneling of pairs between two super- 
conductors. A DC voltage V leads to an AC Josephson current at frequency 
2eV. Even if the voltage difference is zero, there will be a DC Josephson 
current if the superconducting order parameters onthe left and right have 
different phases. (The flow of this DC current will feed back into the elec- 
trostatics of the problem and, eventually, turn off this current.) 

The Josephson current cannot be understood with the semiconductor 
model since it is due to the tunneling of pairs. It can be understood as an 
oscillation of the phase difference between two superconductors. As such, it 
is an example of a collective mode in a superocnductor. 



17.6 Collective Modes of a Superconductor 

If we expand the effective action (17.7) in powers of and its gradients, 
and include the action of the electromagnetic field, then we have: 

S = Jdt d 3 x (tft (i dt _ Ao ) * + ^\(idi - A<) *| 2 + V(\*\) + ± {E 2 (IBfy) 

V(|\l/|) is actually a complicated function, but let us, for the sake of simplic- 
ity, approximate it by: 

U(|*|)=a(|*| 2 - Ps ) 2 (17.71) 

for some constants a and p s . This action is very similar to our effective 
action for 4 He: the £7(1) symmetry ^ — > e ld ^ is broken when ^ has an 
expectation value. The principal difference is the electromagnetic field. 
Following our analysis of 4 He, we write: 

$ = V(Ps + Sp) e ie (17.72) 
We can rewrite the action as: 

S = ! drd d x (^d T 5p + Ps (d t 9 + A ) + 5p (d t 9 + Aq) 
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+ ±-{E 2 -B 2 )) (17.73) 

The first two terms can (naively) be neglected since they are total deriva- 
tives, so the free part of this action is 

S = J drd d x (5p (d t 6 + A)) + ( V6 P) 2 + Ps fio ~ + a5p 2 
+ -L( E 2 -B 2 )) (17.74) 

Let us take the gauge 6 = 0. Then we have: 

S = f dTd d x ( + -!- (V5p) 2 + a5p 2 
+ (y-E-5p) A 

+ Ps A 2 -±-(vxa) 2 ) (17.75) 

From the third line, we see that the transverse electromagnetic field now 
aquires a gap. Its equation of motion is: 

V 2 A = p s A (17.76) 

which has solutions: 

A(x) = A(0) e~ Xx (17.77) 

where A 2 = l/p s . This is the Meissner effect: the magnetic field vanishes in 
the interior of a superconductor. The action (17.75) also implies the London 
equation: 

j=™ =Ps A (17.78) 
SA 

from which the infinite conductivity of a superconductor follows. 

Although the U(l) symmetry has been broken, there is no Goldstone 
boson. The would-be Goldstone boson, 8, has been gauged away. To put 
this more physically, the Goldstone mode would be an oscillation of the 
density. However, as we saw in chapter 15, the Coulomb interaction pushes 
the density oscllation up to a high frequency, the plasma frequency. Hence, 
the would-be Goldstone boson is at the plasma frequency. 
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From the first term in (17.74), we see that bp and are canonical con- 
jugates and the Hamiltonian is: 

H = Jd d k (JJ- k 2 + a) \5 Pk \ 2 + (A )_ k S Pk + Ps k 2 \6 k \ 2 + i- (E 2 H(]B?^ 

From the constraint (the ^4o equation of motion) , 

k 2 (A )_ k = 5 Pk (17.80) 

we have: 

(4>)_ fc = p*P* (17.81) 

Neglecting the magnetic field, since all velocities are much smaller than the 
speed of light, we have: 

H = J d d k ((^ 2 + a + \bpk? + Ps k 2 \e k \ 2 ^j (17.82) 

Since 8p and are canonical conjugates, this is of the harmonic osciallator 
form 

H = j d d k (J^\Pk\ 2 + l -mu 2 \X k \ 2 ^ (17.83) 

with 



w * = y 4 (p« fc2 )(^ + a + 2^ fc2 ) (17 - 84) 

In the long-wavelength limit, A; — > 0, 

Wfc = v 7 ^ (17.85) 

i.e. the mode is gapped. 
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17.7 The Higgs Boson 

17.8 Broken gauge symmetry** 

17.9 The Josephson Effect-xxx 

17.10 Response Functions of a Superconductor- 



17.11 Repulsive Interactions 

In any real metal, there is a large repulsive force due to Coulomb infractions. 
This repulsion is much stronger than the weak attraction due to the exchange 
of phonons, so one might wonder how superconductivity can occur at all. 
The answer is that the repulsive interaction occurs at short time scales and 
high-energies. At the low energies at which superconductivity occurs, the 
repulsion is much weaker. Since a repulsive interaction in the BCS channel is 
marginally irrelevant, as we saw earlier, it will be logarithmically suppressed. 
Consider the following illustrative model: 



with V > and V — V a > so that the interaction is repulsive everywhere, 
but less repulsive near the Fermi surface - i.e. — V a is the weak attraction 
on top of the repulsion V. Let 



XXX 



V(k,k') 




A(k) 



{ 



Ai if lo d < \€ k \ < A or |£ fe /| > lo d 
A 2 if |£fc| < lo d 



The gap equation is: 



Ax = — FAi nL 

2tt 



Ai = — VAi ^— 




1 



1 




-(V-V a )A 2 — 



f-UJ D I 



v / FTAf 



1 



If we assume that A 3> u>d and wp> A2 then we have: 



If . m* , / A\ . 777<* / ujd\ 

A 1 =-FA 1 -l I ^-j-KA 2 -ln(-j 
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From the first equation, we have: 

Ai = A 2 £ In ( (17.88) 



Hence, Ai and A 2 must have opposite signs. Substituting into the second 
equation, we find: 



v.- v 



*2 



This equation will have a solution if 



V a V , k s > ( 17 - 9 °) 



1 + — 



-yinf^) 

T ) 



even if 14 — V < 0. In other words, the bare interaction may be repulsive, 
but the effective pairing interaction can be attractive because the repulsive 
part will be logarithmically suppressed. 

17.12 Phonon-Mediated Superconductivity-xxx 

17.13 The Vortex State*** 

17.14 Fluctuation effects*** 

17.15 Condensation in a non-zero angular momen- 
tum state*** 

17.15.1 Liquid 3 He*** 

17.15.2 Cuprate superconductors*** 

17.16 Experimental techniques*** 
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CHAPTER 18 



Density waves in solids 



18.1 Spin density wave 

Much of the formalism which we used in the previous chapter can be adapted 
to the case of density- waves in fermion systems with nested or nearly nested 
Fermi surfaces. 

18.2 Charge density wave*** 

18.3 Density waves with non-trivial angular momentum- 

xxx 

18.4 Incommensurate density waves*** 
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Part VI 



Gauge Fields and 
Practionalization 
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CHAPTER 19 



Topology, Braiding Statistics, and Gauge Fields 



19.1 The Aharonov-Bohm effect 

As we have discussed, systems of many particles tend to form energy gaps as 
a way of lowering their energy. One might be tempted to conclude that their 
low-energy properties result, trivial, and that interesting physics 

occurs only when they are gapless, either because they are tuned to a crit- 
ical point or because their ground state spontaneously breaks a symmetry. 
However, non-trivial low-energy physics can occur even when a system is 
fully gapped. A fully gapped system can have non-trivial topological prop- 
erties, which do not require low-energy local degrees of freedom. As we 
will see, such properties can be described by gauge fields. These topolog- 
ical properties are concomitant with the phenomenon of fractionalization, 
whereby the quantum numbers of the low-energy excitations of a system can 
be fractions of the quantum numbers of its basic microscopic constituents, 
presumably electrons. Phases which are characterized by fractionalization 
are stable against small perturbations: if the electron breaks into n pieces, 
a small perturbation cannot change this continuously; an electron, unlike 
the average American family, cannot have 2.4 children. It is the fact that 
fractionalization is necessarily characterized by integers which guarantees 
that it is stable if it occurs. 7 

The basic idea can be understood by considering the Aharonov-Bohm ef- 
fect. Suppose an infinitely- long, thin solenoid at the origin which is threaded 
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by flux $ (in units in which h = e = c = 1, one flux quantum is $ = 2ir) is 
surrounded by an infinitely-high potential barrier. As result, electrons are 
prevented from entering the solenoid, and they move in a region in which 
the magnetic field is zero. As Aharonov and Bohm showed, the cross-section 
for an electron of momentum p to scatter off the flux tube is: 

da 1 . 2 $ 

M = 2npsm\e/2) Sm 2 ( } 

In other words, the scattering cross-section is non-trivial and depends on 
<& even though the electron never enters the region in which B / (the 
interior of the solenoid) . 

Any description of the physics of this system in terms of the electric and 
magnetic fields E, B alone cannot be local. It must involve some kind of 
action-at-a-distance so that the magnetic field inside the solenoid can affect 
the electron outside the solenoid. However, a local description can be given 
in terms of the vector potential, 

by simply including this vector potential in the Hamiltonian, 

^ = ^(P-A) 2 V (19.3) 

The electromagnetic potential is an example of a gauge field. By this, 
we mean that the vector potential, A^, is not itself a measurable quantity 
because all physically measurable quantities are invariant under the gauge 
transformation: 

A^(x) -> A^x) - d^xix) 
Y>(x) e ix( - x) il>(x) (19.4) 

The gauge field A^(x) is a redundant way of parametrizing B, E which 
satisfy V • B = 0, V • E = 4irp. This redundancy is the price which must be 
paid in order to retain a local description. 

In particular, when A^{x) = d^f for some /, the electromagnetic poten- 
tial is equivalent under a gauge transformation to zero. However, V x A = 
does not always mean that an f(x) exists such that A = V/. The potential 
(19.2) is an example of such a topologically non-trivial vector field. It is 
locally equivalent to zero, but not globally, as a result of the singularity at 
the origin. 
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If we were to try to gauge away the vector potential (19.2) by taking the 
singular function 

/ = — tan- 1 ^ = — 9 (19.5) 

2vr x 2n ' j 

the wavefunction would no longer be single- valued: 

tfj{r,6) ^e^ e/2 "VM) (19.6) 

This is because, as the electron encircles the origin, it aquires a gauge- 
invariant 'Aharonov-Bohm phase' 

e i§A-d\ = e i<£ ( 197 ) 



which distinguishes the vector potential from a trivial one. However, as the 
above example shows, we can work with a vanishing vector potential at the 
cost of having a multi-valued wavefunction. 

The phase aquired by the electron is independent of how close the elec- 
tron comes to the solenoid or how fast it moves and depends only on the 
topology of the electron's path, namely how many times it winds about the 
origin. Hence, the gauge field (19.2) gives rise to a 'topological interac- 
tion', which is felt by the electron even if it is infinitely far away from the 
solenoid. As we discuss below, it is customary in certain circumstances to 
separate such topological interactions from ordinary ones which do depend 
on distance and lump them into particle 'statistics'. 

As we will see, the low-energy excitations of a strongly-interacting elec- 
tron system can aquire similar phases - i.e. have non-trivial braiding proper- 
ties - when they encircle each other. These phases result from the electron- 
electron correlations which are encoded in the ground-state wavefunction. A 
local description of the physics of these excitations must incorporate gauge 
fields for the reason which we saw above. Unlike the electromagnetic field, 
these gauge fields will be a dynamically generated feature of the low-energy 
properties of the system. Such a system can be fully gapped, in which case 
the non-trivial braiding properties of the excitations come into play at the 
finite energies at which these excitations are created. However, even at 
low-energies, these braiding properties are manifested in the ground state 
on manifolds of non-trivial topology. The ground state is degenerate, a re- 
flection of the braiding properties of the quasiparticles. The effective field 
theories of these ground states and of the ground states with a fixed number 
of quasiparticles are called topological quantum field theories. 
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Figure 19.1: Different trajectories of hard-core particles in 2 + 1 dimensions 
which are not adiabatically deformable into each other. 

19.2 Exotic Braiding Statistics 

Let us consider the braiding properties of particle trajectories in 2 + 1- 
dimensions (2 spatial and 1 time dimension). According to Feynman, the 
quantum-mechanical amplitude for hard-core particles which are at x±, X2, ■ ■ ■ , x. 
at an initial time to to return to these coordinates at a later time t is given 
by a sum over all trajectories. Each trajectory is summed with weight e tS . 
This particular assignment of weights gives consistency with the classical 
limit. However, a peculiarity of two spatial dimensions is that the space of 
trajectories of hard-core particles is disconnected, as may be seen in figure 
(19.1). 

Consequently, at the quantum mechanical level, we have the freedom, 
as Leinaas and Myrrheim, Wilczek, . . . observed, to weight each of these 
different components by a different phase factor. Since these components are 
not continuously deformable into each other, the stationary phase condition 
associated with the classical limit does not constrain these phases. 

These phase factors realize an Abelian representation of the braid group, 
whose elements are the different components of trajectory space with a com- 
position operation obtained by simply following one trajectory by another. 
Let us consider the case of two identical particles. The braid group is sim- 
ply the group of integers, with integer n corresponding to the number of 
times that one particle winds counter-clockwise about the other (negative 
integers are clockwise windings). If the particles are identical, then we must 
allow exchanges as well, which we can label by half-integer windings. The 
different representations of the braid group of two identical particles are la- 
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Figure 19.2: An assignment of phases to different disonnected components 
of the space of trajectories of two particles. 



belled by a phase a, so that a trajectory in which one particle is exchanged 
counter-clockwise with the other n times receives the phase e ma . 

If a = 0, the particles are bosons; if a = tt, the particles are fermions. For 
intermediate values of a, the particles are called anyons. The braid group of 
N particles has more complicated representations which can be non-abelian, 
but a class of its representations is just an extension of the two-particle case: 
whenever any of N identical particles is exchanged counter-clockwise n times 
with another, the phase associated with this is e ma . 

In a slight abuse of terminology, we use the term 'statistics' to describe 
these representations of the braid group. In reality, it is more like a topolog- 
ical interaction since it is not limited to identical particles. Different particle 
species can have 'mutual statistics' when they wind about each other (since 
they are not identical, they cannot be exchanged). This is quite different 
from the case in higher dimensions, where there is no braid group, and we 
only have the permutation group - which acts only on identical particles - 
whose only abelian representations are bosonic and fermionic. To emphasize 
the distinction between this notion of statistics and the usual one, we will 
use the term 'braiding statistics'. 

As we will see in the next chapter, this expanded notion of statistics 
is more than a mathematical curiosity; it is realized in all of its glory in 
the quantum Hall effect. First, however, we will discuss its field-theoretical 
implementation. 
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19.3 Chern-Simons Theory 

Non-trivial braiding statistics can be implemented by taking wavefunctions 
which are multi-valued so that a phase is aquired whenever one particle 
is exchanged with another. However, as we saw at the beginning of this 
chapter, we can make these wavefunctions single-valued by introducing a 
gauge field a (distinct from the electromagnetic field A) which gives rise to 
a vanishing magnetic field but is not gauge-equivalent to zero, in the spirit 
of (19.2). 

/ \ & V^Z X (x — Xj) , 

•tt=sin^r (19 - 8) 

where Xj is the position of the i th particle. When one particle winds around 
another, it aquires a phase. An exchange is half of a wind, so half of this 
phase is aquired during an exchange. 

Such a gauge field is produced automatically if we add a Chern-Simons 
term to the action. Consider the addition of such a term to the action for a 
system of free fermions: 

S = j ^t( i $_ ao )^ + _L^t( iV -a)V +^ J ' dtcPxe^a^a,, 

(19.9) 

The action (19.9) is invariant under the gauge transformation 

a M (x) -> a^(x) - d^x{x) 

V>(x) ->■ e ix{x) ^(x) (19.10) 

up to the boundary term 

6S = ^f # x xtiA a i ( 19 - n ) 

Z9 JdR 

In an infinite system or on a compact manifold, we can ignore this boundary 
term. When we consider a bounded region R of the plane, this term will be 
important, as we will discuss in the context of the quantum Hall effect. 

Since no time derivative of ao appears in the Lagrange multiplier, it is a 
Lagrange multiplier. If we vary it, we obtain the constraint: 

Vxa = $^ (19.12) 

This constraint completely fixes a^, up to gauge transformations. Hence, 
the gauge field has no independent dynamics of its own; it is completely 
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determined by ip(x). According to the constraint (19.12) a flux <& is attached 
to each fermion. 

Let us consider the Chern-Simons action in the gauge ao = 0. The action 

is 

S = J V' t (idt - ao) ip + ^ ift{iV - a) 2 tp ( a i^ofl2 - a 2 d ai) 

(19.13) 

Thus, the Hamiltonian of the Chern-Simons gauge field vanishes. Note, 
however, that the Hamiltonian must be supplemented by the constraint 
(19.12). 

Hence, the Chern-Simons term does what we want - i.e. implement 
anyonic braiding statistics - and it does nothing else. 

19.4 Ground States on Higher-Genus Manifolds 

Let us now imagine that the particles are all gapped, so that we can integrate 
them out. Let us further assume that the Chern-Simons coefficient is an 
integer m divided by Air. We will return to this assumption below. Then, 
the effective action at low energies is simply 

S =^J ^ P ^d u a p (19.14) 

This theory would appear to be completely trivial. The gauge field is 
fixed by the constraint 

Vxa = (19.15) 

and the Hamiltonian vanishes. Thus, the effective action only describes the 
ground state - or states. 

On the infinite plane or the sphere, the ground state is a unique, non- 
degenerate state. Pure Chern-Simons theory (i.e. without any other fields 
to it) has no other states. However, suppose that the theory is defined on 
the torus. Then V x a = can still give rise to non-trivial 

if 7 winds around one of the non-trivial cycles of the torus. According to 
the constraint, Ay does not depend on the precise curve 7 but only on how 
many times it winds around the generators of the torus. Furthermore, it is 
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clear that A^ is additive in the sense that its value for a curve 7 which winds 
twice around one of the generators of the torus is twice its value for a curve 
7 which winds once. Hence, we have only two independent variables, A±, 
A2 associated with the two generators of the torus. If we take coordinates 
81,62 € [0, 2ir] on the torus, then 

A i= / aidBi (19.17) 
J 

From (19.13), we have the following equal-time commutation relations: 

\a 1 (x),a 2 (x')] =i— 8^(x-x') (19.18) 
m 

from which it follows that 

2-7T? 

[^i,A 2 ] = — (19.19) 
m 

Since A\, A2 are not themselves gauge-invariant, we cannot simply use 
the analogy between their commutation relations and those of p, x for a 
single particle. We must work with the gauge invariant quantities e iA \ 
They have more complicated comutation relations. Since 

e iA, Qi A 2 = e [A u A 2 ]/2 e iA 1+ iA 2 ^ Q 2Q) 

we have the commutation relation 

e *Ai e iA 2 = e 2ni/m & iA 2 g iAi (IQ.21) 
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Figure 19.4: Creating a quasiparticle-quasihole pair, taking them around 
either of the generators of the torus and annihilating them leads yields 
two non-commuting operations which encode quasiparticle statistics in the 
ground state degeneracy. 

This algebra can be implemented on a space of minimum dimension m: 

e iA ^ \n) = e 2 ™ /m \n) 

e iM \n) = \n + l) (19.22) 

i.e. the ground state is m-fold degenerate. On a genus g manifold, this 
generalizes to m 9 . 

This has an interpretation in terms of the quasiparticle spectrum of 
the theory - about which we thought that we had lost all information by 
going to low energies below the quasiparticle energy gap. Imagine creating a 
quasihole-quasiparticle pair, taking them around one of the two non-trivial 
loops on the torus and annihilating them. Call the corresponding operators 
Xi, T 2 . If the quasiparticles have statistics ir/m, then 

TxT 2 = e 2ni/m T 2 Ti (19.23) 

because the particles wind around each other during such a process, as 
depicted on the right of figure 19.4. This is precisely the same algebra 
(19.21) which we found above, with a minimal representation of dimension 
m. 

Hence, if we know that the ground state degeneracy of a system on a 
genus- g manifold is m 9 , then one explanation of this degeneracy is that it 
has non-trivial quasiparticles of statistics 0, n/m, . . . , (m — l)7r/m. 

Why did we take the Chern-Simons coefficient to be an integer? This 
is required when we define Chern-Simons theory on compact manifolds or, 
equivalently, when we require invariance under large gauge transformations. 
On a compact manifold, the Chern-Simons action transforms under a gauge 
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transformation defined by a function x( x ) as: 

^ J e^a^a p - £ y e^%cW p + 2™iV (19.24) 

where iV is the winding number of the map from x to e* x ^ £ f/(l). Hence, 
invariance of the functional integral mandates that we take m to be an 
integer. 



CHAPTER 20 



Introduction to the Quantum Hall Effect 



20.1 Introduction 

In 1879, E.H. Hall performed an experiment designed to determine the sign 
of the current-carrying particles in metals. If we suppose that these particles 
have charge e (with a sign to be determined) and mass m, the classical 
equations of motion of charged particles in an electric field, E = E x ~k + E y y , 
and a magnetic field, B = Bz are: 

dpx ^ , 
— = eE x - uj c py - p x /r 

d ^ = eE y +Lo cPx -p y /T (20.1) 

where uj c = eB/m and r is a relaxation rate determined by collisions with 
impurities, other electrons, etc. Let us, following Hall, place a wire along the 
x direction in the above magnetic fields and run a current, j x , through it. 
In the steady state, dp x /dt = dp y /dt = j y = 0, we must have E x = j x 
and 

B -eh $/<I>o 
E y = j x = — --L-° Jx 20.2 

ne |e| e z JS 

where n and N are the density and number of electrons in the wire, $ is the 
magnetic flux penetrating the wire, and $o = is the flux quantum. Hence, 
the sign of the charge carriers can be determined from a measurement of 
the transverse voltage in a magnetic field. Furthermore, according to (20.2), 
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Figure 20.1: p xx and p xy vs. magnetic field, B, in the quantum Hall regime. 
A number of integer and fractional plateaus can be clearly seen. This data 
was taken at Princeton on a GaAs-AlGaAs heterostructure. 

the density of charge carriers - i.e. electrons - can be determined from the 
slope of the p xy = E y /j x vs B. At high temperatures, this is roughly what 
is observed. 

In the quantum Hall regime, namely at low-temperatures and high mag- 
netic fields, very different behavior is found in two-dimensional electron sys- 
tems. p xy passes through a series of plateaus, p xy = £ \, where v is a 
rational number, at which p xx vanishes [?, ?], as may be seen in Figure 20.1 
(taken from [?]). The quantization is accurate to a few parts in 10 8 , making 
this one of the most precise measurements of the fine structure constant, 

2 

Oi = j-, and, in fact, one of the highest precision experiments of any kind. 

Some insight into this phenomenon can be gained by considering the 
quantum mechanics of a single electron in a magnetic field. Let us sup- 
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pose that the electron's motion is planar and that the magnetic field is 
perpendicular to the plane. For now, we will assume that the electron is 
spin-polarized by the magnetic field and ignore the spin degree of freedom. 
The Hamiltonian, 

H = — (-i HV + - A) 2 (20.3) 
2m V c / 

takes the form of a harmonic oscillator Hamiltonian in the gauge A x = —By, 
A y = 0. (Here, and in what follows, I will take e = |e|; the charge of the 
electron is — e.) If we write the wavefunction i/j(x,y) = e lkx 4>(y), then: 



Hip 



<t>(v) e 



ikx 



(20.4) 



The energy levels E n = (n + ^)huj c , called Landau levels, are highly degener- 
ate because the energy is independent of k. To analyze this degeneracy (and 
in most of what follows) it will be more convenient to work in symmetric 
gauge, A = x r Writing z = x + iy, we have: 



m 



&+ All) 



(20.5) 



with (unnormalized) energy eigenfunctions: 



>l 2 



TP n , m (z,z) = z m L™(z,z)e K 



(20.6) 



at energies E n = (n + ^)fou> c , where L™(z, z) are the Laguerre polynomials 
and £q = a/ K/ (eB) is the magnetic length. 

Let's concentrate on the lowest Landau level, n = 0. The wavefunctions 
in the lowest Landau level, 



^n=^m{z,Z) = z m e ^0 



(20.7) 



are analytic functions of z multiplied by a Gaussian factor. The general 
lowest Landau level wavefunction can be written: 



^n=o,m(*,*) = /(*)e 4< o (20.8) 

The state ip n =o,m is concentrated on a narrow ring about the origin at radius 
T m = lo\/2(m + 1). Suppose the electron is confined to a disc in the plane 
of area A. Then the highest m for which ip n=0im lies within the disc is 
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given by A = Trr mmax , or, simply, m max + 1 = $/$ , where $ = BA is 
the total flux. Hence, we see that in the thermodynamic limit, there are 
/<&o degenerate single-electron states in the lowest Landau level of a two- 
dimensional electron system penetrated by a uniform magnetic flux <J>. The 
higher Landau levels have the same degeneracy Higher Landau levels can, 
at a qualitative level, be thought of as copies of the lowest Landau level. The 
detailed structure of states in higher Landau levels is different, however. 

Let us now imagine that we have not one, but many, electrons and 
let us ignore the interactions between these electrons. To completely fill p 
Landau levels, we need N e = p(&/&o) electrons. Inverting the semi-classical 
resistivity matrix, and substituting this electron number, we find: 

e 2 N e e 2 , , 

^=Twr^ p (20 - 9) 

for p filled Landau levels, where N$ = 3>/<i>o- 

Suppose that we fix the chemical potential, jjl. As the magnetic field is 
varied, the energies of the Landau levels will shift relative to the chemical 
potential. However, so long as the chemical potential lies between two Lan- 
dau levels (see figure 20.2), an integer number of Landau levels will be filled, 
and we expect to find the quantized Hall conductance, (20.9). 

These simple considerations neglected two factors which are crucial to 
the observation of the quantum Hall effect, namely the effects of impurities 
and inter-electron interactions. 8 The integer quantum Hall effect occurs in 
the regime in which impurities dominate; in the fractional quantum Hall 
effect, interactions dominate. 9 



20.2 The Integer Quantum Hall Effect 

Let us model the effects of impurities by a random potential in which non- 
interacting electrons move. Clearly, such a potential will break the degen- 
eracy of the different states in a Landau level. More worrisome, still, is the 
possibility that some of the states might be localized by the random poten- 
tial and therefore unable to carry any current at all. The possible effects of 
impurities are summarized in the hypothetical density of states depicted in 
Figure 20.2. 

Hence, we would be led to naively expect that the Hall conductance is 
less than ^- p when p Landau levels are filled. In fact, this conclusion, though 
intuitive, is completely wrong. In a very instructive calculation (at least from 
a pedagogical standpoint), Prange [?] analyzed the exactly solvable model 
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Figure 20.2: (a) The density of states in a pure system. So long as the 
chemical potential lies between Landau levels, a quantized conductance is 
observed, (b) Hypothetical density of states in a system with impurities. 
The Landau levels are broadened into bands and some of the states are 
localized. The shaded regions denote extended states, (c) As we mention 
later, numerical studies indicate that the extended state (s) occur only at the 
center of the band. 
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impurity region 



Figure 20.3: (a) The Corbino annular geometry, (b) Hypothetical distribu- 
tion of energy levels as a function of radial distance. 

of electrons in the lowest Landau level interacting with a single J-function 
impurity. In this case, a single localized state, which carries no current, is 
formed. The current carried by each of the extended states is increased so as 
to exactly compensate for the localized state, and the conductance remains 
at the quantized value, a xy = This calculation gives an important hint 
of the robustness of the quantization, but cannot be easily generalized to 
the physically relevant situation in which there is a random distribution of 
impurities. To understand the quantization of the Hall conductance in this 
more general setting, we will turn to the beautiful arguments of Laughlin 
(and their refinement by Halperin [?]), which relate it to gauge invariance. 

Let us consider a two-dimensional electron gas confined to an annulus 
such that all of the impurities are confined to a smaller annulus, as shown 
in Figure 20.3. Since, as an experimental fact, the quantum Hall effect is 
independent of the shape of the sample, we can choose any geometry that 
we like. This one, the Corbino geometry, is particularly convenient. States 
at radius r will have energies similar to to those depicted in Figure 20.3. 

Outside the impurity region, there will simply be a Landau level, with 
energies that are pushed up at the edges of the sample by the walls (or a 
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smooth confining potential). In the impurity region, the Landau level will 
broaden into a band. Let us suppose that the chemical potential, [/,, is above 
the lowest Landau level, fi > Hlo c /2. Then the only states at the chemical 
potential are at the inner and outer edges of the annulus and, possibly, in 
the impurity region. Let us further assume that the states at the chemical 
potential in the impurity region - if there are any - are all localized. 

Now, let us slowly thread a time-dependent flux <J>(i) through the center 
of the annulus. Locally, the associated vector potential is pure gauge. Hence, 
localized states, which do not wind around the annulus, are completely 
unaffected by the flux. Only extended states can be affected by the flux. 

When an integer number of flux quanta thread the annulus, <3?(i) = p$0) 
the flux can be gauged away everywhere in the annulus. As a result, the 
Hamiltonian in the annulus is gauge equivalent to the zero-flux Hamiltonian. 
Then, according to the adiabatic theorem, the system will be in some eigen- 
state of the &(t) = Hamiltonian. In other words, the single-electron states 
will be unchanged. The only possible difference will be in the occupancies 
of the extended states near the chemical potential. Localized states are un- 
affected by the flux; states far from the chemical potential will be unable to 
make transitions to unoccupied states because the excitation energies asso- 
ciated with a slowly-varying flux will be too small. Hence, the only states 
that will be affected are the gap less states at the inner and outer edges. 
Since, by construction, these states are unaffected by impurities, we know 
how they are affected by the flux: each flux quantum removes an electron 
from the inner edge and adds an electron to the outer edge. Then, J I dt = e 



Clearly, the key assumption is that there are no extended states at the 
chemical potential in the impurity region. If there were - and there probably 
are in samples that are too dirty to exhibit the quantum Hall effect - then 
the above arguments break down. Numerical studies [?] indicate that, so 
long as the strength of the impurity potential is small compared to ftu c , 
extended states exist only at the center of the Landau band (see Figure 
20.2). Hence, if the chemical potential is above the center of the band, the 
conditions of our discussion are satisfied. 

The other crucial assumption, emphasized by Halperin [?], is that there 
are gapless states at the edges of the system. In the special setup which 
we assumed, this was guaranteed because there were no impurities at the 



and JVdt = f!§ = h/e, so: 




V 



(20.10) 
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edges. In the integer quantum Hall effect, these gapless states are a one- 
dimensional chiral Fermi liquid. Impurities are not expected to affect this 
because there can be no backscattering in a totally chiral system. More 
general arguments, which we will mention in the context of the fractional 
quantum Hall effect, relate the existence of gapless edge excitations to gauge 
invar iance. 

One might, at first, be left with the uneasy feeling that these gauge in- 
variance arguments are somehow too 'slick.' To allay these worries, consider 
the annulus with a wedge cut out, which is topologically equivalent to a 
rectangle (see the article by D.J. Thouless in the first reference in [?]). In 
such a case, some of the Hall current will be carried by the edge states at 
the two cuts (i.e. the edges which run radially at fixed azimuthal angle). 
However, probes which measure the Hall voltage between the two cuts will 
effectively couple these two edges leading, once again, to annular topology. 

Laughlin's argument for exact quantization will apply to the fractional 
quantum Hall effect if we can show that the clean system has a gap. Then, 
we can argue that for an annular setup similar to the above there are no 
extended states at the chemical potential except at the edge. Then, if thread- 
ing q flux quanta removes p electrons from the inner edge and adds p to the 
outer edge, as we would expect at v = p/q, we would have a xy = | 

20.3 The Fractional Quantum Hall Effect: The 
Laughlin States 

A partially filled Landau level of non-interacting electrons has a highly de- 
generate ground state in the absence of impurities. This degeneracy is bro- 
ken by fairly generic interactions, including Coulomb repulsion. As we will 
see below, at special filling fractions, there is a non-zero gap between the 
ground state and the lowest excited state. In very clean samples, the im- 
purity potential will be a weak perturbation which pins the quasiparticles 
but does not drastically affect the physics of the ground state. If the sample 
is too dirty, however, the fractional quantum Hall effect will be destroyed. 
In what follows, we will try to understand the physics of a partially filled 
Landau level of interacting electrons in a clean system. We will further as- 
sume that we can ignore all higher Landau levels. This assumption will be 
valid in the limit that the cyclotron energy is much larger than the Coulomb 
interaction energy, Hlu c 3> j^. In a sample with density 1.25 x 10 _11 cm -2 , 

the v = ^ state is seen at 15T, while hio c = at 6T. Hence, higher Lan- 
dau levels are probably unimportant qualitatively, but could lead to some 
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quantitative corrections. 

Let us, following Haldane [?], consider the special interactions for which 
the Laughlin states are the exact ground states. To do this, let us first look 
at the two-electron problem in the lowest Landau level. We separate the 
center-of-mass and relative motions, ip = P(Z, z ) ( we wu l be sloppy and 
drop the Gaussian factors because they are unimportant for this analysis), 
where Z = z\ + z 2 , z = z\ — z 2 , and z\ and Z2 are the coordinates of the two 
electrons. The Hamiltonian has no kinetic part in the lowest Landau level. 
Dropping the constant hco c , it is given simply by the interaction, V, which 
depends only on the relative motion 

H-v(z,±) (20.11) 

Let us now switch to a basis of relative (canonical) angular momentum 
eigenstates, L z \m) = m\m), which are given in position space by (z\m) = 
z m . Then, we can write: 

H=Y, VmPrn (20.12) 
modd 

The restriction to odd m is due to Fermi statistics. V m = (m\V\m); (m\V\m') 
vanishes for m 7^ m' if V is rotationally invariant. P m is the projection oper- 
ator onto states of relative angular momentum m. Suppose we take V m > 
for m < k and V m = for m > k. Then the states ip(z) = z m are pushed 
up to energies E m = V rn for m < k but the states tp(z) = z m , m > k remain 
degenerate at E = 0. 

The Hamiltonian for the iV-electron problem is just: 

i>j modd 

where Pm projects the i — j pair onto a state of relative angular momentum 
m. Let us consider the simple, but unrealistic interaction V\ / 0, V m = for 
m > 1. Any wavefunction in the lowest Landau level, ip = P(z\, &2, ■ ■ ■ , zjy) 
can be written: 

Y> = ^(Zi- Zj) m F m (zi + Zj - z k , k ± (20.14) 

m odd 

If we take F m = for m = 1, then Hip = 0. In this case, (zj — zj) 3 is a 
factor of ip for all i / j. Hence, the following wavefunctions all have zero 
energy 

iP = Y[( Zi - Zj f S( Zl ,z 2 , ...,z N ) (20.15) 

i>j 
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where S(zi, Z2, ■ ■ ■ , zm) is a symmetric polynomial. These states describe 
droplets of electrons. Of these wavefunctions, the Laughlin wavefunction 
[% 

v>3=n>-^) 3 ( 2 °- 16 ) 

i>j 

is the most spatially compact droplet. In a confining potential, this will be 
the ground state. The other symmetric polynomials correspond to quasiholes 
and edge excitations. Had we chosen V m > for m < 2k + 1 and V m = for 
m > 2k + 1, we would have found the wavefunction (20.16) with the power 
3 replaced by 2 k + 1: 

^ + i=n^-^') 2/£+1 (20.17) 

i>j 

The maximum power of any z\ in the Laughlin state is 3(N — 1). Since 
the single-electron state with canonical angular momentum m encloses area 
27t^q(to-|-1), the Laughlin state of N electrons occupies area A = 2tt£q(3(N — 
1) + 1) = 2irh/(eB)(3(N - 1) + 1). The total flux piercing this area is 
$ = BA = $o(3(A r — 1) + 1). Hence, the filling fraction, v is 



TV N 1 



$/$o 3(iV-l) + l 3 



(20.18) 



in the thermodynamic limit. 

To compress this state, that is, to get v < 1/3, at least one pair of 
particles will have relative angular momentum m = 1, which costs a finite 
amount of energy. A more precise and general way of stating this result 
involves calculating the compressibility, n 

A fdN\ 

at fixed angular momentum L {A is the area of the system). For our choice 
of interaction, E (N) = E (N — 1) = but E (N + 1) > for fixed total 
angular momentum 3N(N — 1). Hence, ^ = Eq{N) — Eq(N — 1) = while 
/i^r = Eq(N + 1) — Eq(N) 0. The discontinuity in the chemical potential 
implies incompressibility according to (20.19). For more realistic potentials, 
it may no longer be true that ^ = 0, but the discontinuity will persist. 

The Laughlin wavefunction (20.17) was initially proposed as a trial varia- 
tional wavefunction for electrons interacting with Coulomb interactions. For 
small numbers of electrons, it has remarkably large overlap with the exact 
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ground state (see, for instance, the article by F.D.M. Haldane in the first 
reference in [?]). At filling fraction v = l/(2k + 1), the wavefunction must 
be a homogeneous polynomial of degree (2k + 1)N(N — 1) /2. In other words, 
if we fix the coordinates z±, Z2, ■ ■ ■ , zjv-i of N — 1 of the electrons, then the 
wavefunction, considered as a function of the remaining electron, zn, will 
have (2k + 1) zeroes for each of the N — 1 electrons. Since the electrons 
are fermions, there must be at least one zero at the positions of the other 
electrons. A state at v = 1/ (2k + 1) is specified by the positions of the other 
zeroes. In the Laughlin state, there is a 2k + 1-fold zero at the positions 
of the other electrons; all of the zeroes are at the electron locations. In the 
exact ground state of electrons with some other kind of interaction, say the 
Coulomb interaction, there are still (2k + 1) zeroes bound to each electron, 
but they are slightly displaced from the electron. The quantum Hall effect 
breaks down precisely when the zeroes dissociate from the electrons. 

A particularly useful technique for obtaining many properties of the 
Laughlin states is the plasma analogy (see, for instance, the article by R.B. 
Laughlin in the first reference in [?]). Since \ip\ 2 is of the form of the Boltz- 
mann weight for a classical finite-temperature plasma of charge 2k + 1 par- 
ticles in a neutralizing background, 

|^| 2 = e 2FKr( 2 ( 2fe + 1 ) 2 Elnk l -^|-(2fe+l)EI^| 2 /<) = g-ZWplasma (20.20) 

the expectation value of many operators in the ground state is just given by 
the corresponding expectation values in the plasma. For instance, since the 
temperature T = 2k + 1 is above the melting temperature for the plasma, 
we can conclude that the correlation functions of the density do not exhibit 
long-range positional order. 10 Combining this result with our earlier dis- 
cussion of the compressibility, we can say that the Laughlin states describe 
incompressible quantum liquids. 

To establish the quantum Hall effect in these states, we need to un- 
derstand the excitation spectrum. In particular, we must show that there 
is a finite energy gap separating the ground state from excited states. If 
we imagine adiabatically inserting a flux tube at the origin in a Laughlin 
state at v = l/(2k + 1), then, by arguments very similar to those used in 
the annulus geometry, we expect charge e/(2k + 1) to be transported from 
the insertion point to the outer edge of the system. The flux tube can be 
gauged away, leaving an eigenstate of the original Hamiltonian with a deficit 
of l/(2k + 1) of an electron at the origin [?]. n Such an excitation is called 
a 'quasihole.' If the inserted flux were oppositely directed, an excitation 
with an excess charge of —e/(2k + 1) at the origin would be created, or a 
'quasiparticle.' 
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Laughlin suggested the following quasihole state, 



^2k+l = ~[[( Z i -V)^2k+1 



(20.21) 



which is an exact zero-energy eigenstate of the Hamiltonian (20.13) and 
has a large overlap with the exact quasihole state of a system with a small 
number of electrons interacting through the Coulomb interactions. In this 
state, the angular momentum of each electron is increased by one and the 
net flux penetrating the electron droplet is increased by one flux quantum. 
The state: 



looks like the ground state of N + 1 electrons, but with a deficit of one 
electron at the position rj. Hence, the state (20.21) has charge e/(2k + 1) at 

V- 

A quasiparticle wavefunction which is an exact eigenstate of the Hamil- 
tonian (20.13) has not been found. The trial wavefunction: 



has reasonably good overlap with the exact quasihole state in systems with a 
small number of electrons. The quasiparticle has fractional charge — e/(2k + 
1). As a general rule, exact quasiparticle eigenstates are more difficult to 
come by than quasihole states, so we will primarily discuss quasiholes. Most 
of the properties of quasiparticles can be inferred from those of quasiholes. 

At v = l/(2k + 1), the gap between the ground state and a state with 
a widely-separated quasihole-quasiparticle pair is just (/Li + — n~)/(2k + 1). 
This follows from the definition of ^ and the fact that a widely separated 
pair will have no interaction energy. A = (fi + — fi~)/(2k + 1) is the gap 
which is measured in transport experiments - for instance from p xx ~ e A / T 
- since a widely separated pair must be created to carry a longitudinal 
current. However, this is not the smallest gap in the system. A quasihole- 
quasiparticle pair at finite separation will have lower energy as a result of 
the Coulomb interaction between them. Suppose the distance between the 
quasihole and quasiparticle is parametrized by k so that the distance is 
ki\ > £ . Then, we can think of the quasihole and quasiparticle - which 
have core sizes on the order of a few magnetic lengths - as point charges; 

2 

the energy of the pair will be E(k) = A — [?]. The pair will move in a 
straight line with velocity v/, = perpendicular to the line connecting 



(20.22) 




(20.23) 
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them since the Coulomb force between them will exactly balance the Lorentz 
force due to the magnetic field. At low k, the quasihole-quasiparticle pair 
evolves continuously into a collective mode, the magneto-roton [?]. The 
name magneto-roton stems from the fact that this collective excitation is 
obtained in the single-mode approximation just as the roton was in Feyn- 
man's analysis of superfluid 4 He. As we will see later, the analogy between 
the quantum Hall effect and superfluidity can be further exploited. 
To summarize, the Laughlin state has the following properties: 

• It is a wavefunction describing electrons in a strong magnetic field. 
The electrons are assumed to be in the lowest Landau level. 

• It is the non-degenerate ground state of a model repulsive Hamiltonian 
(20.13). 

• It is an excellent approximation to the ground state of electrons in a 
magnetic field interacting through the Coulomb potential. 

• The state is incompressible. 

• The state does not break translational symmetry, i.e. it is a liquid. 

In order to observe a fractional quantum Hall plateau with a xy = x , 
(j M = 0, we also need a small amount of impurities as well, in order to pin 
any quasiparticles which are produced by small changes of the magnetic field 
or electron density. However, we don't want too much disorder since this 
might simply pin the electrons and prevent them from forming a correlated 
state (20.16). 



20.4 Fractional Charge and Statistics of Quasipar- 
ticles 

Let us return to a discussion of the quantum numbers of the quasiholes and 
quasiparticles. We found earlier that these excitations carry fractional elec- 
tric charge. This is remarkable, but has a precedent in polyacetylene; the 
statistics, to which we now turn, is perhaps even more exotic. If we sup- 
pose that the phase acquired by the wavefunction when one quasihole moves 
around another is e 1 ^, then the phase for taking one electron around another 
is e*( 2fc+1 ) ^, and the phase associated with taking an electron around a 
quasihole is e *( 2fc + 1 )<^ ; since m quasiholes is equal to a deficit of one elec- 
tron. From the wavefunction (20.21), we see that e i{2k+l ^ = e 2ni and 
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e i(2k+l) 2 4> = e 2n(2k+l)i_ Thig would j ead ug tQ CQnclude that e i4> = e 2ni/(2k+l)^ 

Similar arguments would lead us to conclude that quasiparticles have the 
same statistics. 12 

These heuristic arguments for the charge and statistics of the quasiholes 
and quasiparticles are inadequate even though they give the correct answers. 
Fortunately, these quantum numbers can be determined directly. Following 
Arovas, Schrieffer, and Wilczek [?], we will calculate the Berry's phase [?] 
acquired when quasiholes are moved around loops. Recall that the adiabatic 
theorem deals with a family of Hamiltonians, H({Aj}), parameterized by 
{Ai, A2, . . . , Afc}, with non-degenerate eigenstates |n({Aj})): 

H({A,}) |n({Ai})> = E n ({Xi})\n({Xi})) (20.24) 

Suppose we vary the Aj's slowly with time 13 , A, = Aj(i), such that H(Aj(0)) = 
H(Aj(T)); then |n({A;(0)})) = M |n({Aj(T)})), where M is a phase. Of- 
ten, we require M = 1, but this is unnecessary. A state \ip{t)) satisfying 
|^(0)) = |n({Aj(0)})) will evolve subject to Schrodinger's equation, 

mumm)) = j t \m) (20.25) 

so that 

|V>(T)) =Me i ^eiti E ^ dt \i){Q)) (20.26) 
Berry's phase, j n , is given, according to Schrodinger's equation (20.25), by 

In = i J (n({\m)\j t H{Xi(t)})) (20.27) 

The integral (20.27) is reparameterization invariant, so Berry's phase de- 
pends only on the path in parameter space; in particular, 7„ remains finite in 
the adiabatic limit, unlike the dynamical phase, E(t) dt. One other point 
worth mentioning is that Berry's phase (20.27) only depends on the Hamil- 
tonian implicitly. In what follows, we will be interested in the Berry's phase 
acquired by quasihole wavefunctions as the quasiholes are moved around. 
We will implicitly assume that there is some Hamiltonian with a pinning 
potential, say, for which the state with a quasihole at r] is a non-degenerate 
eigenstate. As the location of the pinning potential is moved, this eigenstate 
evolves, and a Berry's phase will accumulate, but we need not be concerned 
with the details of the Hamiltonian to do this calculation. 
We consider, then, the Laughlin quasihole 



\m) = H(v(t)-zi)^2k+i 

i 



(20.28) 
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and take r](t) to move slowly around some loop as a function of t. Since 
we can rewrite 

(m\j t \m) = J d 2 z-^—m)\J2 s ( z -^\^ 

1,2 (m\p(z)\m) (20.30) 



/ 



where p(z) is the density. Then, the Berry's phase acquired in a circuit, C, 
bounding a region R of area ^4^ is: 

7n = i j> d V J d 2 z (20.31) 

p(z) = po except in the core of the quasihole. Since PqAr = N = &r/&o, 
where <3?r is the flux in the region R, 

7n = * I # z 

J J V ~ z 

= -2ir J d 2 z (p(z)) 
2ir 

($ fl /$o) (20.32) 



2k + 1 

up to corrections of order t^/Ar, where r c is the size of the quasihole. This is 
just the phase that we would expect for a particle of charge ve L = e/ (2k + 1) 
in a magnetic field. 

Suppose, now, that we had considered a multi-quasihole wavefunction. 
If the loop C had enclosed another quasihole, p(z) would no longer be given 
by Po = ^^fcTT^/^o- There would be a charge deficit at the position of 
the second quasihole. Then, we would find: 



-2tt J d 2 z(p(z)) 



Hence, there is an additional phase 27r/(2A; + l) acquired when one quasihole 
winds around another. In other words, quasiholes in the Laughlin state at 
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v = l/(2k + 1) have fractional statistics given by the statistics parameter 
a = l/(2k + 1), where bosons have a = and fermions, a = 1. The frac- 
tional charge and statistics of the quasiholes are the characteristic features 
of fractional quantum Hall states. 

In chapter 4, we will be interested in non-Abelian statistics, which can 
occur when there is a set of degenerate states, \a; {Aj}), a = 1, 2, . . . , g. In 
such a case, a state \ip(i)) satisfying \ip(0)} = |a;{Aj(0)}} evolves into: 

|y,(T)) =e i ^eiti E ^ dt M\b;{\ i {Q)}) (20.34) 

The degenerate subspaces must be equivalent at t = and t = T since 
the Hamiltonians coincide, but the states \a;{Xi(t)}) at t = and t = T 
can differ by an overall rotation; M is the matrix which implements this 
rotation. The Berry phase matrix, 7^, is given by: 

lab=i j {a-{\ t (t)}\j t W,{h{t)}) (20.35) 

20.5 Fractional Quantum Hall States on the Torus 

As we discussed in the last chapter the existence of anyonic quasiparticles in 
a system is reflected in its ground state degeneracy on higher-genus surfaces. 
By the arguments given there, we expect the Laughlin state for v = 1/m 
to be m-fold degenerate on a torus. In this section, we will construct the 
m wavefunctions on a torus which are annihilated by the Hamiltonian with 

v u ..., V m ^ ^0,V m = v m+1 = . . . = 0. 

In order to do so, we will make use of the Cauchy ^-functions, which are 
functions defined on the torus. Let us assume that z is a complex coordinate 
on the torus and that the torus is defined by z = z + 1, z = z + t, where r is 
some complex number which is called the modular parameter of the torus. 
Then the ^-functions are defined by: 

00 

0i(*|t)= Yl e"*( n+ 5) T e 27ri ( n+ ^)(. z +^) (20.36) 

n=— 00 

and 

2 (z\t) 
i9 3 (z\t) 
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#4 {z\t) = $1 (^Z + ^t\t 

(20.37) 

The following properties of #i will be useful: 

01 {z+ 1\t) = (z\t) 

X ( Z + r|r) = -e~ i7rT e" 2 ™ 2 ??! (z\t) 

(-z\t) = (z\t) (20.38) 

Armed with these functions, we can generalize the Laughlin wavefunction 

to: 

m 

y> = n - ^i r )] m eii " z ( z - z «i r ) ( 2 °- 39 ) 

i>j a=l 

At short distances, $i (zj — Zj|t) — ► z* — Zj, so this wavefunction is anni- 
hilated by the Hamiltonian which annihilates the Laughlin wavefunction 
on the plane. The only remaining requireent is that it be periodic under 
z — > z + 1, z — > z + r. These will be satisfied if 

/_ 1 \mJV e iK = f^Nt e iK = 1 

(_ e — ) mJV e itfr e 2*£ z = e «r = i (20 .40) 

There are m different choices of K,Z a . To see this, observe that the ratio 
between any two wavefunctions associated with two choices of K, Z a is a 
meromorphic function of Z on the torus with m simple poles. By a special 
case of the Riemann-Roch theorem, there are m linearly independent such 
functions. (Haldane, 1984) 



20.6 The Hierarchy of Fractional Quantum Hall 
States 

Thus far, we have only explained the existence of the quantized Hall plateaus 
at v = l/(2k + 1). From Figure (20.1), however, we can see that there are 
plateaus at several other odd-denominator fractions. These other states 
can be thought of as descending from the Laughlin states [?, ?]. Following 
Halperin, let us consider a 'primary' state at v = I /(2k + 1) with a finite 
density of quasiholes or quasiparticles. Since they are charged particles in a 
magnetic field, we might expect that the quasiholes or quasiparticles them- 
selves would be in a primary state (e.g. a Laughlin state) at certain preferred 
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quasihole densities. At what densities would we expect this? Electrons form 
Laughlin states only at v = l/(2k + 1) because these are the only filling 
fractions at which T\ [z% — Zi) x l v is an acceptable fermionic wavefunction. A 
Laughlin state of bosonic particles would form at v = \/{2k). Following 
this reasoning, a Laughlin state of quasiparticles of statistics —1/(2 A; + 1) 

would be of the form T\ {z% — Zi) 2p ~ 2k + 1 , while a quasihole state would be 
of the form T\ (zi ~ z~i) 2p+2k+1 since quasiholes have the opposite charge. 
Hence, the preferred filling fractions for quasiparticles and quasiholes are 
l/(2p— 2k+i ) an d V(2p+ 2k+i ) ' respectively. However, we should remember 
that these particles are fractionally charged as well, so their Landau levels 
will have {Q/(2k + l))/$o states rather than Hence, a 'descendent' of 

the v = l/(2k + 1) primary state which has quasiholes or quasiparticles in, 
respectively, filling fraction l/(2p± 2 k+\ ) states has electron filling fraction: 

/ i \ 

2fc+l 



v = _2k+l 



= t (20.41) 

(2* + l)±i 

If we now imagine the quasiholes or quasiparticles of this state forming a 
Laughlin state, and so on, we will get the continued fractions: 

v = (20.42) 



2k + l± 



1 

2 Pl ± 

2 P2 ± ■■. 

Every odd-denominator fraction less than 1 can be obtained in this way. 
Of course, fractional quantum Hall states are not observed at all of these 
fractions. As we descend through this hierarchy of states, the energy gaps 
become smaller and hence more easily destroyed by impurities. Furthermore, 
even in a pure system, the quasiholes or quasiparticles could form Wigner 
crystal states at some filling fractions rather than quantum Hall states. 



20.7 Flux Exchange and 'Composite Fermions' 

Another perspective on the hierarchy of fractional quantum Hall states in- 
volves mapping a fractional quantum Hall state to an integer quantum Hall 
state. This can be accomplished by introducing an auxiliary Chern-Simons 
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gauge field which attaches an even number of flux tubes to each electron. 
The attachment of an even number of flux tubes has no physical effect since 
it will change the phase acquired under braiding or exchange by a multiple 
of 27r. However, approximations that would have seemed unnatural without 
the auxiliary gauge field appear quite sensible in its presence. 

Let us consider the more general problem of anyons with statistics pa- 
rameter 6 in a magnetic field: 

H = ^-(p-e(a + A)) 2 + H int (20.43) 

where 

eV xa = 2(6-n) J^(r - r;) (20.44) 

i 

Here, we have represented the anyons as fermions interacting with a Chern- 
Simons gauge field. If we now replace this field by its spatial average, e V x 
(a) = 2(6 — ir)p, then this mean field theory is just the problem of fermions 
in an effective magnetic field 

eB cS = e V x (a + A) = eB + 2(6 - n)p (20.45) 

If there is a state of fermions in B e g with a gap, then the fluctuations about 
mean-field theory can probably be ignored. 

Suppose our anyons are actually fermions. Then, we can take 9 = ir and 
eB e g = eB. However, we could, instead, take 6 = (±2k + 1)tt, since this will 
give fermionic statistics as well. In such a case, eB e g = eB ± 27r(2k)p, or 
l/^eff = 1/^ ± 2k. Let us choose B e Q so that an integral number of Landau 
levels, n, are filled; this state will have a gap. Since f e ff = n, 

(20.46) 



2kn± 1 



For n = 1, this is just the Laughlin sequence. By exchanging real magnetic 
flux for the fictitious statistical flux of an auxiliary Chern-Simons gauge 
field, we have related the Laughlin states to a single filled Landau level. 

If we fix k = 1 and consider u e g = 1, 2, 3, . . . , n, we have v = |, |, |, . . . , 2 n+i • 
These are the filling fractions of the hierarchical sequence descending from 

„-I, 1 (20.47) 

3 5 1' 7 1 ' V ' 



2 1 

2 -2 
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Successive levels of the hierarchy are thereby related to states with additional 
filled Landau levels. In somewhat misleading, but ubiquitous, jargon, the 
fractional quantum Hall states of electrons are integer quantum Hall states of 
'composite fermions' [?]. The term 'composite fermion' refers to a composite 
object formed by an electron and an even number of flux quanta. This 
object fills an integer number of Landau levels of the remaining, uncanceled 
magnetic field. 

At this point, we have only shown that there are quantum Hall states 
obtained by the 'composite fermion' construction at the same filling frac- 
tions at which there are hierarchical states. It is not clear that the two 
different constructions yield states in the same universality class. That they 
do can be shown by demonstrating that both constructions lead to states 
with quasiparticles of the same charge and statistics and, hence, the same 
ground state degeneracy on a torus. We will show this in the next chapter 
using the field-theoretic descriptions of these states. 

Here we have considered only the simplest 'composite fermion' states. 
More complicated states can be constructed by introducing Chern-Simons 
gauge fields which only interact with electrons in particular Landau levels. 
Similar constructions are also available for spin-unpolarized and multi-layer 
systems. 

Jain [?] used the 'composite fermion' construction to motivate the fol- 
lowing trial states for the filling fractions v = 2fc " +1 : 



Vlll indicates projection into the lowest Landau level. The wavefunction 
^ n (zk) is the wavefunction of n filled Landau levels, so it has vanishing 
projection into the lowest Landau level, and will contain powers of Zi. How- 
ever, the factor rii>j( z « ~~ z j) 2k wm multiply this by many more powers of 
Zi. It may be shown that the resulting expression has large projection into 
the lowest Landau level. At an operational level, the lowest Landau level 
projection is accomplished by moving all of the factors of Zi to the left and 
making the replacement Z{ — > These wavefunctions have large overlaps 
with the exact ground states of systems with small numbers of particles. 

As we have seen, the mapping of an electron system at one filling fraction 
to a (presumably, weakly interacting) fermion system at a different filling 
fraction has shed considerable light on the fractional quantum Hall effect. 
This mapping has even proven to be useful starting point for a quantitative 




(20.48) 
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analysis. This mapping is a special case of the flux exchange process [?]: 
if we change the braiding statistics of the particles in a system and, at the 
same time, change the magnetic field, in such a way that 

2pA# = eAB (20.49) 

or, equivalently, 

A (|) - A (I) (20.50) 

then the properties of the system will not change, at the mean field level. If 
we assume that the fluctuations about mean-field theory are small, then the 
phase diagram of Figure 20.4 holds, with properties qualitatively unchanged 
along the diagonals [?]. 

In this way, we can map electron systems to other fermion systems, to 
Bose systems, or even to systems whose basic constituents are anyons. In 
the next chapter, we will see that the mapping from a fractional quantum 
Hall state to a Bose superfluid is the starting point for effective field theories 
of the quantum Hall effect. 

20.8 Edge Excitations 

In our discussion of the integer quantum Hall effect, we saw that there were 
necessarily gapless excitations at the edge of the system. The same is true 
in the fractional quantum Hall effect. To see this, let us consider again our 
simple Hamiltonian which annihilates the Laughlin state. All of the states 

V> = S( Zl , z 2 , • • • , z N )H { Zi - Zj ) m (20.51) 

i>j 

are also annihilated by H. In a more realistic model, there will be a confining 
potential V(r) which favors states of lower total angular momentum. The 
Laughlin state itself, with S = 1 is then the ground state, and the other 
states are edge excitations. They are spanned by: 

i> = X\(sn) Pn n^-^) m (20.52) 

n i>j 

where 



Sn = J>? 

i 



(20.53) 
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Figure 20.4: Systems at different points along the diagonals A = A 
in the magnetic field-statistics plane have the same properties at the mean- 
field level. 
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Suppose that i^disk = H + V con f. Then, to lowest order in the angular 
momentum, M, relative to the ground state: 

-ffdisk \P1,P2, ■■■} = f{M) \pi,P2, • • •) 

« (const.) M\p 1 ,p 2 ,...) 
= >^JP n n\pi,P2,---) 

n 

= v (^P n ~^j • • -> 

where v = 2ir\/L. This is the spectrum of a free bosonic field, but a chiral 
one, since only pi > are allowed. 

These bosonic excitations are simply the edge waves of an incompressible 
liquid. They will exist in any incompressible chiral fluid. To see how the 
edge excitations of a given quantum Hall state depend on the particular 
state, consider a quantum Hall state on an annulus, rather than a disk, so 
that there are inner and outer edges. The Laughlin state on an annulus can 
be described by: 

v>=n^ m n^-^-r (20.54) 

i i>j 

where t is a large integer so that the inner radius of the annulus is Iq \f2{tm + 1) 
while the outer radius is £q y / 2m(iV + t). Essentially, we have carved out the 
inner hole of the annulus by removing t electrons from the center of the disk. 
If we take a quasiparticle in the bulk and move it along a trajectory encir- 
cling the origin, it will not aquire a phase e 2mt = 1. 
We can now create edge excitations generated by 

Sn = Y, Z i ( 20 - 55 ) 

i 

for both positive and negative n, so long as n < t. We will take t ~ N so 
that both the inner and outer radii of the annulus are macroscopic; then for 
reasonable values of n, we will have excitations of both chiralities. Hence, 
the combined theory of both edges is a non-chiral boson. 

This theory has free bosonic excitations which are divided into m sectors, 
corresponding to states which are built by acting with the s ra 's on 



^ = n^n (* - **) 

i i>j 



(20.56) 
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where r = 0, 1, . . . , m — 1. Note that r = r + m by shifting i by one. These 
different sectors correspond to transferring a quasiparticle from the inner 
edge to the outer edge; sectors which differ by the transference of an in- 
eteger number of electrons from on edge to the other are equivalent. The 
different sectors may be distinguished by the phases which are aquired when 
quasiparticles encircle the origin, r = corresponds to periodic boundary 
conditions for quasiparticles. r ^ corresponds to 'twisted' boundary con- 
ditions for quasiparticles; they aquire a phase e 2mr / m upon encircling the 
origin. The sectors of the edge theory correspond to the m-fold degenerate 
ground states of the theory on a torus, as may be seen by gluing the inner 
and outer edges of the annulus to form a torus.. 



CHAPTER 21 



Effective Field Theories of the Quantum Hall Effect 



21.1 Chern- Simons Theories of the Quantum Hall 
Effect 

The preceding discussion has been heavily dependent on Laughlin's wave- 
functions. However, these wavefunctions are not the exact ground states of 
any real experimental system. Their usefulness lies in the fact that they 
are representatives of a universality class of states, all of which exhibit the 
fractional quantum Hall effect. What has been missing to this point is a 
precise sense of which properties of these wavefunctions define the univer- 
sality class, and which ones are irrelevant perturbations. We alluded earlier 
to the binding of zeroes to electrons. We will formalize this notion and use it 
to find low-energy, long-wavelength effective field theories for the fractional 
quantum Hall effect. One formulation of these effective field theories is in 
the form of a Landau-Ginzburg theory which is strongly reminiscent of su- 
perfluidity or superconductivity. One important difference, however, is that 
the order parameter is not a local function of the electron variables. This 
is not a trivial distinction, and it is, ultimately, related to the conclusion 
that a novel type of ordering is present in the quantum Hall states, namely 
'topological ordering.' 

Recall that in a superfluid or a superconductor the off-diagonal entries 
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of the density matrix: 

p(r,r') = (^t(r)V>(r')) = J dr 2 . . . dr N tp*(r, r 2 , . . . , r N ) ^(r', r 2 , . . . , r N ) 

(21.1) 

exhibit off-diagonal long-range order, 

p(r,r')-<f(r)<Xr') (21.2) 

for some non-zero function (f>(r). Feynman argued that the ground state 
wavefunction of a Bose fluid would have no zeroes, so it can be chosen 
everywhere real and positive. In the absence of phase variations, (21.2) 
will hold. As a result of (21.2), we can choose states of indefinite particle 
number such that {ip(r)) = <j>(r). <fi(r) can be treated as a classical field and 
used to analyze interference phenomena such as the Josephson effect. More 
importantly, off-diagonal long-range order is the hallmark of superfluidity. 

What happens if we calculate (21.2) in a Laughlin state state at v = 
l/(2fc + l)? 

p(r,r') = (^(r)^(r')) 

J i k>l 



This correlation function does not show any signs of long-range order. The 
fluctuating phases of the first two terms in the integral lead to exponential 
falloff. On the other hand, if we consider correlation functions of: 

tf(z) = e~ i( - 2k+1) /dVlmln(z- 2 ')y,t( z ) (21.4) 

this phase is removed and we find algebraic falloff of correlation functions, 
or quasi-long-range order, 

(WW*)) - k _ z ,p + D/ 2 (21-5) 

as may be shown using the plasma analogy [?]. The drawback of this order 
parameter is that it is not an analytic function of the z's, and, hence, is not 
a lowest Landau level operator. We could, instead, take: 

which not only has true long-range order, but also remains strictly within 
the lowest Landau level [?]. However, the field theory of this operator is 
more complicated, so we will use (21.4). 
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A Landau-Ginzburg theory may be derived for the order parameter 
(21.4) [?] in the following way. Begin with the Lagrangian for interacting 
electrons in a magnetic field such that v = l/(2k + 1): 

C eS = r (id " 4>) V> + V^V - A) v - 

+ V(x - x')ip\x)ip(x)ip ] (x')ip(x') (21.7) 

We now rewrite this in terms of a bosonic field, </>(x), interacting with a 
gauge field. The gauge field is given a Chern-Simons action of the type 
discussed two chapters ago, so that its only role is to transform the bosons 
<fi into fermionic electrons. 

£ eff = <f {ido - (oo + Aq)) 4> + ^ <f (iV - (a + A)f<t> 
-H$<l> + V(x - x)<j)\x)<p{x)<j)\x)(t){x) 

+ wni^' a - B ^ (2L8) 

Note that the coefficient of the Chern-Simons term is 2k + 1. We could 
have chosen any odd integer in order to obtain the correct statistics; the 
coefficient 2k + 1 is chosen for reasons which will become clear momentarily. 
To see that the correct statistics are obtained, note that the ao equation of 
motion is: 

V x a(r) = 2^(2A; + 1) p(r) (21.9) 

The Chern-Simons gauge field equation attaches 2k + 1 flux tubes to each <j> 
boson. As one boson is exchanged with another, it acquires an Aharonov- 
Bohm phase of (— l) 2fe+1 = —1 as a result of these flux tubes. 

As in the Landau-Ginzburg theory of a superconductor, long-range order 
in the bosonic field cj) - i.e. \<p\ 2 = p - breaks a U(l) symmetry. The Meissner 
effect results, i.e. a + A = 0, since a non-zero constant effective magnetic 
field V x (a + A) would lead to badly divergent energy in (21.8) if \<p\ 2 = p. 
This implies that B = -V x a = 2vr(2A; + l)p, or v = l/(2k + 1). 

Furthermore, excitations about this mean-field solution are gapped, as 
in the case of a superconductor, as a result of the Anderson-Higgs effect. 
This is, of course, what we expect for a quantum Hall state: there is a gap 
to all excitations. 

If B is increased or decreased from this value, vortices are created, as in 
a type II superconductor: 



<f>(r,e) = \cf>(r)\e 



id 
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with |0(O)| = 0, |0(oo)| = y/po, /(°°) = 0. These vortices are the Laughlin 
quasiholes and quasiparticles. They have one flux quantum of the a gauge 
field and very little real magnetic flux. As a result of the flux quantum of a 
which they carry, they have charge l/(2k + 1), according to (21.9). 

Essentially, the electrons have become bound to 2k+l flux tubes - or 2k+ 
1 zeroes as we put it earlier - thereby transmuting them into bosons in zero 
field. These bosons undergo Bose condensation; the fractional quantum Hall 
liquids are these Bose condensed states. Said slightly differently, the Chern- 
Simons gauge field, which satisfies Vxa(r) = (2k+l)p(r), has been replaced 
by its spatial average, V x (a(r)) = (2k + l)(/>). The average field cancels 
the magnetic field so that the bosons can condense. The fluctuations of a 
around its average value could, in principle, destabilize the Bose condensed 
state, but they do not because there is an energy gap. 

At finite temperature, there will always be some thermally excited quasi- 
particles, with a density ~ e _A//T , where A is the (finite) energy cost of a 
quasiparticle. The presence of these quasiparticles means that the quantum 
Hall effect is destroyed at any finite temperature: p xx ~ e~^l T . Howver, 
the deviation from the zero-temperature behavior is small for small T. 

Note that the Chern-Simons term results in an important difference 
between the Landau-Ginzburg theory of the quantum Hall effect and the 
Landau-Ginzburg theory of a superconductor. The Chern-Simons term at- 
taches flux to charges, so both particles (electrons) and vortices (quasipar- 
ticles) carry charge and flux. As a result, they are very much on the same 
footing; this can be made even more explicit, as we will see later. In a super- 
conductor, on the other hand, particles (Cooper pairs) carry charge while 
vortices carry flux; they are thereby differentiated. 

21.2 Duality in 2 + 1 Dimensions 

The Landau-Ginzburg theory which we have just discussed has a dual for- 
mulation which will prove useful in much of the following discussion. We will 
first consider duality more generally for a (7(1) theory in 2 + 1 dimensions 
and then consider the particular case of the quantum Hall effect. 
Consider a Landau-Ginzburg theory for a U(l) symmetry: 
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b + B = f(r) 




(21.10) 
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In chapter 11, we showed that such a theory could, in its broken symmetry 
phase, be simplified by writing ip = ■ v /pje* e ' and integrating out the gapped 
fluctuations of p s : 

C = ^Wf ( 2L11 ) 
This Lagrangian has a conserved current, d^j^ = given by 

j„ = p s d^e (21.12) 

We have assumed that there are no fluctuations in the amplitude. However, 
we can allow one type of amplitude fluctuations, namely vortices, if we allow 
to have singularities. Then the vortex current takes the form: 

j; = e^ x d u d x 9 (21.13) 

The conservation law (21.12) can be automatically satisfied if we take 

j/x = tnv\d v a x (21.14) 

Note that a\ is not uniquely defined, but is subject to the gauge transfor- 
mation a\ — > a\ + d^x- Equation (21.14) can be used to solve for and 
substituted into equation (21.13): 

duUu = Ps j; (21.15) 

where f^ v is the field strength associated with the gauge field a\: 

= d u a^ - d^a v (21.16) 

If we introduce a vortex annihilation operator, $„, then (21.15) is the equa- 
tion of motion of the dual Lagrangian: 

£ Du ai = \^\{d^-ia^ v \ 2 + Vbd^l) + ^-UuUu (21.17) 

where k is a vortex stiffness and V$ (|^u|) is the vortex- vortex interaction. 
The vortex density and current are given by 

3l = Tm{KQ* v (d li -ia li )* v } (21.18) 

The final term in the dual Lagrangian is a Maxwell term for the gauge field, 

f^ivf^iv = e — b 
ei = d ai - dia 
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b = eijdidj (21.19) 

which is of the same form as the action for the electromagnetic gauge field, 
F F — F 2 — R 2 

Notice that the conservation law (21.12) which followed from the equa- 
tions of motion in the original representation is a trivial, topological identity 
in the dual representation, following from (21.14). The definition (21.13) of 
the vortex current in the original representation is the equation of motion 
(21.15) in the dual representation. 

The broken symmetry phase of our original theory (21.11) is the phase 
in which = 0. Vortices are gapped; the low-energy effective action in 
the dual language is simply 

-CDual = „ ffiuffiu (21.20) 

The gauge field a M is the dual formulation of the Goldstone boson. However, 
when the symmetry is restored by the proliferation and condensation of 
vortices, (<&„) = <3?o / 0, the dual action is in its Higgs phase: 

k 1 
^Duai = 7T l $ o| 2 a^a^ + —f^uffiu (21.21) 



2 1 U| ^ 2p. 



s 



and the gauge field becomes massive. Hence, it is possible, by a duality 
transformation, to pass between an XY theory and a U(l) Higgs theory. 

The degrees of freedom of the scalar field 6 have a dual representation 
in terms of a gauge field a M . In 2 + 1 dimensions, a gauge field has one 
transverse component - i.e. one gauge-invariant degree of freedom per point 
-i.e. the same number of degrees of freedom as a scalar field. In 1 + 1 
dimensions, a guage field has no local degrees of freedom. Duality in 1 + 1 
dimensions connects two scalar field theories, as we will see later. 

Topological defects in the vortex order parameter, <I>, carry one quantum 
of a M flux. Hence, they are simply charges - e.g. Cooper pairs if the Landau- 
Ginzburg theory describes a superconductor. Hence, the duality operation 
exchanges particles and vortices. In the original representation (21.11), the 
Cooper pairs are the fundamental quanta while vortices are topological de- 
fects. In the dual representations, the fundamental quanta are vortices while 
the topological defects are Cooper pairs. 

Let us now extend this to transformation to the Chern-Simons theory of 
the quantum Hall effect. Suppose we consider this theory 

C cS = 4>* (ido - (ao + A))) 4> + ^ <f (»V - (a + A)) 2 
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+ V (M 2 ) + ^ Up ^a p (21.22) 

in its fractional quantized Hall phase. We write <f> = ^fpe %e and integrate 
out the gapped fluctuations of p: 

^cff =\ P (d»9 -a„- A^f + i- e^%cU p (21.23) 

This theory has a conserved current which is simply the electrical current: 

j il = p{d i fi-a iX -A v ) (21.24) 

We construct the dual representation of this current with a gauge field a: 

3n = £^\d v a\ (21.25) 

As in the derivation above, we consider vortices in the order parameter. As 
we saw at the beginning of this chapter, they are simply Laughlin quasipar- 
ticles and quasiholes. Their current is given by: 

jf = e^xdud x 9 (21.26) 

Using the dual expression for the current to eliminate 8^6 from the right- 
hand-side of this equation, we have: 

- hit*, = j; - e»v\dv (a„ + A„) (21.27) 

Ps 

This is the equation of motion of the dual Lagrangian: 

£ Dua i = \^ - ia^\ 2 + U$ (|<I> qp |) + y 9 Mv» 

+ ^ a^xdu (ax + A x ) + —e^'a^a,, (21.28) 

where $ is the vortex annihilation operator. Integrating out a„, which 
appears quadratically, we find: 

^Dual = ^«|(^-* a M)* qP | 2 + ^(l $qP D + 

2k + 1 1 1 

—Z-fTPandvCLp + — /^ + — ct^xdvAp (21.29) 

Since the Maxwell term for a p has one extra derivative compared to 
the Chern-Simons term, it is irrelevant in the long-wavelength limit. Let us 
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drop the Maxwell term and consider the effect of the Chern-Simons term. 
Since the quasiparticle annihilation operator is coupled to the Chern-Simons 
gauge field, a^, each quasiparticle has flux l/(2k + 1) attached to it. Hence, 
quasiparticles have statistics it /(2k + 1). According to the last term in 
(21.29), the external electromagnetic potential Aq is coupled to the fictitious 
flux eijdiCtj. Since each quasiparticle has flux l/(2k + 1) attached to it, it 
has charge e/(2k + 1). 

In the phase in which (<3? qp ) = 0, we can integrate out the quasiparticles, 
thereby renormalizing the Maxwell term. We can then integrate out a^, 

2k + 1 1 1 

= 2kTT A ^ d » A » ( 2L3 °) 

which leaves us with an effective action for the electromagnetic field which 
incorporates the Hall conductance l/(2k + 1). 

Hence, this duality transformation has transformed an action (21.22) in 
which the basic field (j) represents a charge e fermionic electron and the basic 
soliton is a charge e/(2/c+l), statistics ir/ (2/c+l) quasiparticle into an action 
(21.29) in which the basic field <J? qp represents a charge e/(2k + 1), statistics 
it/ (2k + 1) quasiparticle. To complete the correspondence, we must show 
that the basic soliton in (21.29) is a charge e fermionic electron. To do this, 
we must consider the state in which <J> qp condenses. 

When (<I? qp ) = <I> qp / 0, there are solitons in this state 

$ qp ( r ,#) = |$ qp (r)|e ie 
P + B = f(r) 

with |<I> qp (0)| = 0, |<& qp (cx))| = yfpo, /(oo) = 0. They carry one flux quantum 
of the gauge field, so they are fermionic, charge e particles - i.e. electrons 
are the solitonic excitations of the state in which <£ qp condense. 
When (3> qp ) = $ qp / 0, we have the effective action: 

k 2k 4- 1 1 

The first term gives a Higgs mass to the gauge field a^, which can now be 
integrated out. In doing so, we can neglect the Chern-Simons term, which 
is irrelevant compared to the Higgs mass. The resulting effective action for 
the electromagnetic gauge field is then 



C eS = -kqF^F^ (21.32) 
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In other words, the system is an insulator. The quantum Hall state with 
&xy = ve 2 /h is dual to the insulating state with a xy = which is formed 
when quasiparticles condense. 

Note that this insulating state is not the only state into which quasipar- 
ticles can condense. As we saw earlier in our construction of the hierarchy, 
quasiparticles can also condense into fractional quantum Hall states, thereby 
leading to a xy = (2k + 1 ± l/p)~ l e 2 /h. Hence, the hierarchy construction 
is simply a variant of duality in 2 + 1 dimensions, as we discuss in the next 
section. In this section, we have considered the 'usual' case, p = 0. 



21.3 The Hierarchy and the Jain Sequence 

To construct the hierarchy, let us begin with the dual theory (21.29). 

C^ = \n\(d li -ia^\ 2 + V*(|^|) + 

2k + 1 1 1 

4?r ^ up a l id l/ a p + —f l tvf ll u + — a ll e flu xd u A fl (21.33) 

Let us now introduce another gauge field, /, which attaches an even 
number of flux tubes to the quasiparticles and, therefore, has no effect. In 
other words, (21.29) is equivalent to the effective action 

C Dua , = ^\(d,-ia,-if,)^\ 2 + ^(|«l qp |) + 
2k + 1 1 1 

-■kh^" >u&vU (2L34) 

Let's suppose that <E> condenses. Then the corresponding Meissner effect 
requires that: 

Vxa + Vxf = (21.35) 
Meanwhile, the ao and fo equations of motion are: 

(2k + 1) V x a + V x A = 2tt p qp 

V x f = -27r(2p)p qp . (21.36) 

Combining these three equations, 

2k + l- — )Vxa + VxA = (21.37) 
2pJ 
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Since V x aequiv2irp, we find v = l/(2k + 1 — ^). Continuing in this way, 
we can find the Landau-Ginzburg theories for all of the hierarchy states. 

Let's now consider the 'composite fermion' construction for the 'Jain se- 
quence' v = n/(2pn± 1). We represent the electrons as fermions interacting 
with a Chern-Simons gauge field c M which attaches 2p flux tubes to each 
fermion. The Lagrangian can then be written: 

h 2 

C cS = ip* (ido - c - Aq) ip + — ^*(iV - c - Kfip 
-/i^ijj + V(x - x')ift (x)ip(x)^ (x')ip(x') 

~hh^^ (2L38) 



The flux of c M is anti-aligned with the magnetic field so the effective magnetic 
field seen by the fermions is V x (c + A), which is such that the ^'s - the 
composite fermions - fill n Landau levels in the effective magnetic field. 

To derive the effective theory for this state, we must now construct the 
effective theory for n Landau levels. At v = 1/m, we introduced a Chern- 
Simons gauge field so that we could represent each electron as a boson 
attached to m flux quanta. At v = n, it is not useful to introduce a single 
Chern-Simons gauge field, which would allow us to represent each electron 
as an anyon attached to 1/n flux quanta. Instead, it is more useful introduce 
n gauge fields, each of which is coupled to the electrons in one of the Landau 
levels. We can then represent each electron as a boson attached to one flux 
quantum. The problem with such an approach is that we can only introduce 
n gauge fields if there are n different conserved quantities, namely the charge 
in each Landau level. These charges are not, in general, conserved: only the 
total charge is conserved. We will come back to this point later, and assume 
for now that this will not make a difference. 

Then, the Lagrangian takes the form: 

h 2 2 
Atf = 4>*i (ido -ai-c - A ) 0/ + — - $ (iV - a 1 - c - A) fa 

-Hfifa + V(x - a/)<l>%x)M$<l>*j&)<l>J&) 

- k h + h (21-39) 



where <pi annihilates a boson corresponding to an electron in the I Landau 
level. 
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21.4 K- matrices 

A compact summary of the information in the Landau-Ginzburg theory is 
given by the dual theory. Consider the Landau-Ginzburg theory for u = 
l/(2fc+l): 

C cS = <P* (id - (a + A))) cf>+^ 4>*(iV - (a + A)) 2 cb+u^ + ^e^a^a p 

(21.40) 

Let's apply 2 + 1-dimensional duality to this Lagrangian, following (21.22)- 
(21.29). We find the dual theory: 

^Dual = ^«|(^-^)* qP | 2 + Ml^l) + 

2k + 1 1 

e^a^ap + —f^f^ + a^ vX d v Ap, (21.41) 

or, keeping only the most relevant terms, simply 
2k + 1 

£duai = -^^e^a^ap + A^ vp d v a p + a^ ortcx + £ vortcx (21.42) 

where £ V ortcx is the quasiparticle effective Lagrangian. Let's assume that 
vortices are gapped, but allow for the possibility that the magnetic field is 
not quite commensurate with the density so that there is some fixed number 
of pinned vortices. Then, we can drop the last term, £ V ortex- 

This generalizes to an arbitrary abelian Chern-Simons theory: 

£duai = ^ Kjj e^afaaj + ^A^dWp + "Xrtex I ( 21 -43) 

The Hall conductance of such a state can be obtained by integrating out the 
Chern-Simons gauge fields, which appear quadratically: 

a H = Yyt^K-^j (21.44) 
I, J 

The charge of a vortex (i.e. a quasiparticle) of type i is: 

qj = YJ^K'^u (21.45) 
J 

and the braiding statistics between vortices of types i and j is: 

9u = (K- l ) u . (21.46) 
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Implicit in the normalizations is the assumption that the charges associated 
with the i„ vor tex are quantized in integers. Distinct quantum Hall states are 
therefore represented by equivalence classes of (K, t) pairs under SL(k,Z) 
basis changes where k is the rank of the .KT-matrix. 

Let's now construct the if-matrices associated with the hierarchy and 
the Jain sequence. First, consider the Landau-Ginzburg theory (21.34) of a 
hierarchy state. 

1 K \(d lt -ic i fl -in^\ 2 + v($ q p) 

1 1 ■ 6^/^/^21.47) 



Oh _|_ 1 1 
47T ^ p Z7T 



4ir 2p 



where c* 
p 



a M . We write <£ qp 



$ < 3P|e^q P) integrate out the gapped fluctu- 
ations of |0qp|, and apply steps (21.22)-(21.29) to (21.34) by introducing a 
gauge field, c^: 



e^ X d v cl 



^p^qpHS'Vqp 



/" 



(21.48) 



We use the 2 + 1-dimensional duality transformation to substitute this into 
(21.34). 



2 A- -I- 1 1 

47T C p°W + 27T C M^ A ^ 



(21.49) 



Finally, we integrate out / M : 

2fc + 1 



rP^P 



c u d u c + p c u d u c 



4vr " " ' 4tt 

_|_ J_ ^P,- 1 r 2 , J_ M^P^ 1 fj A 



Hence, a state at the first level of the hierarchy has 

'2k + 1 1 



Pi 



(21.50) 



(21.51) 



and i J = (1,0). 

Continuing in this fashion, we find the i^-matrix of an arbitrary hierarchy 
state (20.42): 

/2fc + l 1 ...\ 

1 p! 1 

1 J>2 1 

1 p 3 



V 



(21.52) 



7 
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and t 1 = 8[. 

Let's now consider the flux-exchange construction of the Jain sequence. 
Starting with (21.39), we write = \§i\e Vl % integrate out the gapped fluc- 
tuations of 1 and apply steps (21.22)-(21.29) to (21.39) by introducing 



gauge fields, a^. 

e^ x d u a{ = J^ = \ ( j )I \ 2 (#Vj -a 1 "-^- A") 
Using 2 + 1-dimensional duality, we re-write this as 



(21.53) 



+ _L f^Pn 1 d a 1 — — fWr d c 

4tt ' i- 2 /( " ' '' 



(21.54) 



Integrating out a 1 ^ and c^, and dropping the subleading Maxwell terms we 
find 



1 

+ 2^ 



(21.55) 



In other words, the flux exchange construction of the Jain sequence is sum- 
marized by the if-matrix: 



K 



Jain 



(2p + l 
2p 
2p 
2p 



2p 
2p+l 
2p 
2p 



2p 
2p 
2p+l 
2p 



2p 
2p 
2p 
2p+l 



...\ 



'■■/ 



(21.56) 



and t 1 = (1,1,... ,1). 

If we make the change of basis K h = W T K Jain W and t h = W' 1 ^, 
with W = 5ij — 5i+i t j, then (21.56) is transformed into (21.52) with 2/c+l = 
2p+l andpi = p2 = ■ ■ ■ = 2. Meanwhile, t Jam is transformed into t h . Hence, 
the two constructions are identical for the corresponding filling fractions. 

The if-matrix formalism also applies to some quantum Hall states which 
we have not yet discussed. These include double-layer quantum Hall states 
- in which there are two parallel layers of electrons - and spin-unpolarized 
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systems. Although, we have thus far assumed that the electrons are spin- 
polarized by the magnetic field, band mass and g factor corrections make the 
ratio of Zeeman to cyclotron energies ~ 7/400, so that it may be necessary to 
include both spins when describing electrons in moderately strong magnetic 
fields, even when the filling fraction is less than unity [?]. An example of a 
wavefunction which can describe spin-polarized electrons in a double-layer 
system or spin-unpolarized electrons in a single-layer system is the (m,m,n) 
wavefunction: 



The u>j's and Zj's are, respectively, the coordinates of the up and down 
spin electrons, or upper- and lower-layer electrons and the filling fraction is 
v = ^p^- The notation of (21.57) is sloppy; (21.57) should be multiplied 
by the spin part of the wavefunction and antisymmetrized with respect to 
exchanges of up- and down-spin electrons. The i^-matrix for this state is 



and t 1 = t 2 = 1. By considering hierarchies built on the (m,m,n) states 
or states of unpolarized electrons in multi-layer systems, we can imagine a 
cornucopia of fractional quantum Hall states specified by K matrices. 

What exactly do we mean when we say that a Chern-Simons theory 
such as (21.43) is the low-energy effective field theory of a quantum Hall 
state? Let us first imagine that our quantum Hall liquid is on a compact 
surface such as a sphere or a torus, rather than in some bounded region of 
the plane as it would be in a real experiment. The Hamiltonian of (21.43) 
vanishes, so every state in the theory has vanishing energy. In other words, 
the Chern-Simons theory is a theory of the ground state(s). This includes 
states with - essentially non-dynamical - quasiholes and quasiparticles at 
fixed positions, since they are the lowest energy states at a given filling 
fraction. This theory is only valid at energies much smaller than the gap 
since it ignores all of the physics above the gap. The leading irrelevant 
corrections to (21.43) are Maxwell terms of the form [d v a^ — d^a v ^ which, 
by dimensional analysis, must have a coefficient suppressed by the inverse 
of the gap. The quasiparticle charges (21.45) and statistics (21.46) are the 
essential physics of the ground state which is encapsulated in this theory. 
This is not all, however. On a surface of genus g, even the state with no 
quasiparticles is degenerate. Two chapters ago, we saw that a Chern-Simons 




(21.58) 
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with coefficient 2k + 1 has a 2k + 1-fold degenerate ground state on the 
torus. This is precisely the ground state degeneracy which we obtained in 
the previous chapter by adapting the Laughlin wavefunctions to the torus. 
This can be generalized to an arbitrary quantum Hall state by diagonalizing 
its K-matrix and multipying the degeneracies of the resulting decoupled 
Chern-Simons terms or; in other words, the degeneracy is simply det-ftT. 
On a genus-<? surface, this becomes (detK) 9 [?]. Since numerical studies 
can be - and usually are - done on the sphere or torus, the degeneracy 
is an important means of distinguishing distinct quantum Hall states with 
different ET-matrices at the same filling fraction. 



21.5 Field Theories of Edge Excitations in the Quan- 
tum Hall Effect 

If, instead, we look at the Chern-Simons theory (21.43) on a bounded region 
of the plane [?], then the variation of the action S = f C is: 

5S= h j d3xK u sa y i/pd v aJ p 

+ I dtdxn u (Ku e^'oj Sa J p + ije^M^) (21.59) 

*R ./boundary 

if we set A p = j M = 0. The action is extremized if we take e pup d u a J p = sub- 
ject to boundary conditions such that Kija 1 ^ vp Sa p = at the boundary. 
Let us suppose that x and y are the coordinates along and perpendicu- 
lar to the boundary. Then, the most general such boundary condition is 
Kijolq + Vija^, = 0. Vjj is a symmetric matrix which will depend on the 
details of the boundary such as the steepness of the confining potential. 
Clearly, KjjOq + Vijol\. = would be a sensible gauge choice since it is 
compatible with the boundary condition. In this gauge, the equation of mo- 
tion following from the variation of Kjjal + Vija^ in (21.43) is a constraint, 
which can be satisfied if a{ = d^ 1 for some scalar field <fi. Substituting this 
into the Lagrangian and integrating by parts, we find that all of the action 
is at the edge: 

S= h I dtdxK u d t4> 1 9A J -Vud^ 1 d x( j) J + A fl e flv d v <p I t I (21.60) 

The Chern-Simons theory of the bulk has been reduced to a theory of (chiral) 
bosons at the edge of the system. These are precisely the excitations which 
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of the Edge 



Figure 21.1: The Chern-Simons theory which describes the braiding of quasi- 
particles in the bulk is associated with a Conformal Field Theory which 
describes the gapless excitations at the edge. 

we derived by multiplying the Laughlin state by symmetric polynomials in 
the previous chapter. 

Let's consider the simplest case, v = 1/m 

S=— I dt dx dt4> d x (f> — v d x (f> d x (f> + — / dtdx A^e^v^v^t 1 (21.61) 
2tt J 2ir J 

This is the action for a free chiral boson. The equations of motion (for 
A^ = for simplicity) 

{d t - vd x ) d x <f> = (21.62) 

are satisfied if the field is chiral, cf>(x, t) = <f>(x + vt). 

The equal-time commutations relations which follow from this action 

are: 

2-7T 

[d x <f>{x),</>{xf)] =i — 5{x- x') (21.63) 

By varying the electromagnetic field, we derive the charge and current op- 
erators: 
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2vr 



dt<, 



(21.64) 



Hence, the operators e*^ and e irn ^ create excitations of charge 1/m: 



— S(x - x') 



m 



5(x — x) 



(21.65) 



These operators create quasiparticles and electrons respectively. 

To compute their correlation functions, we must first compute the (f> — 4> 
correlation function. This is most simply obtained from the imaginary-time 
functional integral by inverting the quadratic part of the action. In real- 
space, this gives: 



(<P(x,t) 0(0,0)) -(^(0,0)0(0,0))= / 



1 



2-7T 2-7T m k{iuj — vk) 
^ 2tt r dk 1 / _ ifc(x+iwr) _ \ 
" m J 2vr k\ J 



giuiT—ikx 



1 

m 



dk 1 

2tt A; 



= In [(x + ivr)/a\ 



rn 



(21.66) 



where a = 1/A is a short-distance cutoff. Hence, the quasiparticle correla- 
tion function is given by: 



e i<Hx,T) e i<Ho,o) 



3 (^,t)#,0))-(^(0,0)^(0,0)> 



(a; + iwr) 1 /™ 
while the electron correlation function is: 



(21.67) 



e im<P(x,r) e im<P(0,0)\ = e m 2 {<l>(x,t) 0(0,0)) -m 2 (0(0,0) 0(0,0)) 



(x + tot) 



(21.68) 



Hence, the quasiparticle creation operator has dimension 2/m while the 
electron creation operator has dimension m/2. 

Let us suppose that a tunnel junction is created between a quantum 
Hall fluid and a Fermi liquid. The tunneling of electrons from the edge of 
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the quantum Hall fluid to the Fermi liquid can be described by adding the 
following term to the action: 

S tnn = t J dre im ^°' T ^(0,r) + c.c. (21.69) 

Here, x = is the point at which the junction is located and ip(x,r) is the 
electron annihilation operator in the Fermi liquid. As usual, it is a dimension 
1/2 operator. This term is irrelevant for m > 1: 

2 = 5<i-»0« Pi-™) 

Hence, it can be handled perturbatively at low-temperature. The finite- 
temperature tunneling conductance varies with temperature as: 

G t ~ t 2 T m ~ 1 (21.71) 

while the current at zero-temperature varies as: 

It ~ t 2 V m (21.72) 

A tunnel junction between two identical quantum Hall fluids has tunnel- 
ing action: 

S tun = t J dTe imM °' T) e - im ^(0,r) + CQ ^73) 

Hence, 

| = (l-m)i (21.74) 

and the tunneling conductance varies with temperature as: 

G t ~t 2 T 2m - 2 (21.75) 

while the current at zero-temperature varies as: 

It ~ t 2 V 2 ™- 1 (21.76) 

Suppose we put a constriction in a Hall bar so that tunneling is possible 
from the top edge of the bar to the bottom edge. Then quasiparticles can 
tunnel across the interior of the Hall fluid. The tunneling Hamiltonian is: 

St mi = v J d re ^i(M e -^2(0,r) + c c (2L77) 
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The tunneling of quasiparticles is relevant 




(21.78) 



where <p\ and 02 are the edge operators of the two edges. Hence, it can 
be treated perturbatively only at high-temperatures or large voltages. At 
low voltage and high temperature, the tunneling conductance varies with 
temperature as: 

G t ~v 2 T™- 2 (21.79) 
while the current at zero-temperature varies as: 

I t ~v 2 V^- 1 (21.80) 

so long as V is not too small. If we measure the Hall conductance of the bar 
by running current from the left to the right, then it will be reduced by the 
tunneling current: 

1 e 2 2 

G = : (const> 2 T™- 2 (21.81) 

m h 

When T becomes low enough, the bar is effectively split in two so that all 
that remains is the tunneling of electrons from the left side to the right side: 

G~t 2 T 2m ~ 2 (21.82) 

In other words, the conductance is given by a scaling function of the form: 

G = — ^ Y (v 2 T^- 2 ) (21.83) 
m h V / 

with Y(x) — 1 ~ —x for x — > and Y(x) ~ x~ m for x — > oo. 

For a general i^-matrix, the edge structure is more complicated since 
there will be several bosonic fields, but they can still be analyzed by the 
basic methods of free field theory. Details can be found in . 

21.6 Duality in 1 + 1 Dimensions 

At the end of the previous section, we saw that a problem which could, in 
the weak-coupling limit, be described by the tunneling of quasiparticles was, 
in the strong coupling limit, described by the tunneling of electrons. This 
is an example of a situation in which there are two dual descriptions of the 
same problem. In the quantum Hall effect, there is one description in which 
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electrons are the fundamental objects and quasiparticles appear as vortices 
in the electron fluid and another description in which quasiparticles are 
the fundamental objects and electrons appear as aggregates of three quasi- 
particles. We have already discussed this duality in the 2 + 1-dimensional 
Chern-Simons Landau-Ginzburg theory which describes the bulk. In this 
section, we will examine more carefully the implementation of this duality 
in the edge field theories. As we will see, it essentially the same as the 
duality which we used in our analysis of the Kosterlitz-Thouless transition. 
In the next section, we will look at the analogous structure in the bulk field 
theory. 

Let us consider a free non-chiral boson <p. It can be expressed in terms 
of chrial fields 4>l and 4>r: 



<P = 4>L + 4>R 
<P = <pL-4*R 

Here, we have defined the dual field (p. Observe that: 



9^ 



d v ip 



The free action takes the form: 



S 



g_ 

8vr 



dx dr 



(d T vf + v 2 {d x <py 



(21.84) 



(21.85) 



(21.86) 



where ip is an agular variable: if = if + 2tt. Let us rescale <p> — ► p/s/g so 
that the action is of the form: 



So 



1 

8^ 



dx dr 



(d T <p) +v 2 {d x ipY 



(21.87) 



As a result of the rescaling of ip which we performed in going from (21.86) 
to (21.87), ip now satisfies the identification tp = p> + 2-Ky/g. 
Note that this theory has a conserved current, <9 M j M = 

j„ = 8^ (21.88) 

which is conserved by the equation of motion. It also has a current 

J? = drf (21.89) 



which is trivially conserved. 
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Let us consider the Fourier decomposition of 4>r,l'- 

Mx, r) = \x^+ PR {it -x) + iJ2-a n e- n{r+ix) 

n 

<f) L (x, t) = \x^+ Pl (it + x) + iJ2-a n e- niT ~ tx) (21.90) 

— Th 

n 



Hence, 

<p(x, t) = <p + i (p L + Pr )t + {pl~Pr)x + [a n e~ n(T+ix) + a n e~ n( - T - ix) 

n 

<p{x, t) = ifo + i {PL -Pr)t + (p L + Pr) x - [a n e~ n{T+ix) - a n e~ n{T ~ ix) 

n 

(21.91 



where ipo = Xq + x R and ipo = Xq — x R . From the identification ip = 
(p + 2iry/~g, it follows that <po = (pQ-\-2ir^/~g. From the canonical commutation 
relations for 99, it follows that ipo and (pl + Pr) /2 are canonical conjugates. 
The periodicity condition satisfied by 930 imposes the following quantization 
condition on (p^ + Pr)' 

M 

PL+PR = ^, M£Z (21.92) 

Furthermore, physical operators of the theory must respect the periodicity 
condition of ip. The allowed exponential operators involving ip are of the 
form: 

{M, 0} = e ^ (:r ' T) (21.93) 

and they have dimension M 2 /g. 

Let us assume that our edges are closed loops of finite extent, and rescale 
the length so that x G [0, n] with ip(r, x) = ip(r, x + it) + 2irN \fg for some 
integer N. Then, from (21.91), we see that we must have 

( P l-Pr) = 2N^, NeZ (21.94) 

These degrees of freedom are called 'winding modes'. Hence, we have: 

M ,r r- 

« = Vf + Ar V5 

PR = ^= - WVS (21-95) 
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Note that this is reversed when we consider (p. Momentum modes are 
replaced by winding modes and vice-versa. Following our earlier steps, but 
taking this reversal into account, the allowed exponentials of <p are of the 
form: 

{0,N} = e i2N ^^ x ' T ^ (21.96) 
Hence, the most general exponential operator is of the form: 

{M,N}^e<^ +N ^^ =e 4(^ +Jv ^)^ + (^- Jv ^)H (21.97) 
with scaling dimension: 

1 / M \ 2 1 / M \ 2 

= <M!V + N* 9 (21.98) 
9 

These dimensions are invariant under the transformation g <-> l/4p, M <-> 
iV. In fact the entire theory is invariant under this transformation. It is 
simply the transformation which exchanges (p and (p. 

When we couple two identical non-chiral bosons, <*p\ and (p2, we form 
ip± = (ipi ± </?i)/\/2. The factor of \[2 is included so that both ip± have the 
same coefficent, 1/8vt, in front of their actions. However, this now means 
that (f± = <p± + 2ny/g/2. When we couple two bosons though exponential 
tunneling operators, only <p_ is affected. Hence, the appropriate duality is 
that for tp^: (g/2) <-> l/[4(^/2)] or, simply g <-> 1/g. This duality exchanges 
cos if- 1 y/g/2 and cos tp- \fg~/2, which transfer, repectively, a pair of solitons 
(i.e. electrons) and a particle-hole pair from system 1 to system 2. 

Let us now apply these considerations to quantum Hall edges. In order to 
apply the above duality - which applies to non-chiral bosons - to a quantum 
Hall edge, which is chiral, we must 'fold' the edge in order to define a non- 
chiral field, as depicted in figure 21.2. 

If we fold the edge at x = 0, we can define (p = (4>(x) + <fi(—x))/\/2 and 
ip = (4>(x) — 4>{—x))/\/2. The latter vanishes at the origin; only the former 
is important for edge tunneling. The allowed operators are: 



e iNifi/^/m 



N 2 

(21.99) 

2m v ; 
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Figure 21.2: An infinite chiral edge mode with a tunnel junction at one point 
can be folded into a semi-infinite nonchiral mode with a tunnel junction at 
its endpoint. 



The factor of 1/2 on the right-hand-side comes from the y/2 in the definition 
of ip. If we couple two edges, we can now define p>-, which has allowed 
operators 

N 2 

— (21.100) 

m 

and dual operators 

= M 2 m (21.101) 

which are dual under M <-> N, m <-> 1/m. The description in terms of if 
is equivalent to the description in terms of (p. However, as we saw in the 
previous section, the tunneling of quasiparticles between the two edges of 
a quantum Hall droplet is most easily discussed in terms of ip when the 
tunneling is weak, i.e. in the ultraviolet, when the tunneling operator can 
be written e %L Pl^™ . However, when the tunneling becomes strong, in the in- 
frared, the dual description in terms of <p is preferable, since the corresonding 
tunneling operator is e lC P^" 1 . 




CHAPTER 21. EFFECTIVE FIELD THEORIES OF THE QUANTUM 
384 HALL EFFECT 



CHAPTER 22 



Frontiers in Electron Fractionalization 



22.1 Introduction 



At present, the only topological phases which are know to occur in nature 
are those which are observed in the fractional quantum Hall regime. As de- 
scribed in the previous chapter, a number of experimentally-observed phases 
are associated with various Abelian topological states. There is some rea- 
son for believing that there are also non- Abelian topological phases lurking 
in some relatively weak quantum Hall plateaus in the second Landau level. 
Furthermore, there is nothing about topological phases which is intrinsic to 
the quantum Hall regime. In principle, they can occur in a number of dif- 
ferent physical contexts and, in fact, an even wider variety of phases (such 
as those which are P, T-invariant) might appear as we explore the full free- 
dom of the phase diagram of electrons in solids. This is a frontier topic, so 
our discussion will necessarily be tentative. Since very little is known on 
the experimental side, our discussion will be rather speculative, but we will 
try to stick to topics where at least the mathematical and theoretical issues 
are clear. In particular, we will focus on the effective field theories which 
describe various topological phases. We will also try to briefly address the 
question of which particular Hamiltonians of electrons in solids will actually 
give rise, but this is still very much an open question. 



385 



38EHAPTER 22. FRONTIERS IN ELECTRON FRACTION ALIZATION 



22.2 A Simple Model of a Topological Phase in 
P, T-Invariant Systems 

Consider the following simple model of spins on a honeycomb lattice. Each 
s = 1/2 spin lies on a link of the lattice. The spins interact through the 
Hamiltonian: 

i v 

where 

A = ^k&M(i) a k > f p = n^epO-fc (22.2) 

and cr^ = ±. These operators all commute, 

[F p , F pl ] = [A*, Aj] = [F p , Aj] = (22.3) 

so the model can be solved exactly by diagonalizing each term in the Hamil- 
tonian: the ground state |0) satisfies Ai\0) = —\0),F p \0) = |0). If we rep- 
resent a z = 1 by colored bonds and a z = —1 by uncolored bonds, then 
A{\0) = — 10) requires chains of bonds to never end, while F p \0) = |0) re- 
quires the ground state to contain an equal superposition of any configura- 
tion with one obtained from it by flipping all of the spins on any plaquette 
(i.e. switching colored and uncolored bonds). 

To understand this Hamiltonian better, it is useful to consider the J\ — ► 
oo limit and to introduce the following representation for the low-energy 
states of this model. We consider wavefunctions ^f[a] which assign a complex 
amplitude for any 'multi-loop' a on the honeycomb lattice. By 'multi-loop', 
we mean simply the disjoint union of a number of closed loops which do 
not share any links of the lattice. The multi-loop a simply represents the 
locations of the o% = +1 spins, so this representation is just the a| basis. 

The ground state condition, F p \0) = |0) is the statement that ^[a] is 
invariant under various geometrical manipulations of the multi-loop a. If 
we draw the multi-loops as if they were in the continuum, but remember 
that they are really on the lattice, then we can draw the relations imposed 
by F p \0) = |0) in the following pictorial form as three distinct relations. 
Depending on whether the plaquette is empty 

The first follows from F p acting on a empty plaquette. It flips this 
plaquette into one which contains a small, contractible loop. Hence, the 
ground state wavefunction is invariant under such an operation, as depicted 
in figure 22.1. Suppose now that a loop passes through a single link of a 
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Figure 22.1: The ground state wavefunction is invariant under the removal 
of a small contractible loop. 

plaquette. When F p acts on this plaquette, it deforms the loop so that it 
passes through the other 5 links of the plaquette and now encloses it. The 
action of F p causes similar deformation of a loop which passes through a 
plaquette along 2, 3, . . . , 5 consecutive links. Hence, the ground state must 
be invariant under such a deformation of any loop, as depicted in figure 22.2. 
If two loops touch a plaquette, then F p cuts the loops and rejoins them so 




Figure 22.2: The ground state wavefunction is invariant under smooth de- 
formation of any loop. In the figure, the loop in the upper left has been 
deformed. 

that they form one big loop. Conversely, if the same loop passes through a 
plaquette twice, then F p breaks it into two loops. The ground state must be 
invariant under such surgery operations, as depicted in figure 22.3. If three 




Figure 22.3: The ground state wavefunction is invariant under a surgery 
procedure which cuts and rejoins two loops which come near each other. 

loops touch a plaquette, then F p performs a surgery which is equivalent to 
two pairwise surgeries, which can be performed in any order. 

Now consider this model on an annulus. It is now possible for loops 
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to wind around the annulus. However, the surgery relation implies that 
ground state wavefunctions must have the same value on configurations with 
winding numbers 0,2,4,.... Similarly, they must have the same value on 
configurations with winding numbers 1, 3, 5, . . .. Thus, there are two ground 
states, corresponding to even and odd winding numbers of the loops around 
the annulus. On the torus, there are four ground states, corresponding to 
even/odd winding numbers around the two generators of the torus. Thus, 
the ground state degeneracy depends on the topology of the manifold on 
which the system is defined - on a genus g surface, it is 4 g . Note, further, 
that the different ground states are locally indistinguishable. They all have 
Ai = — 1, F p = 1 at every vertex and plaquette, so correlation functions of 
local operators are all the same (and vanish unless the two operators share 
a plaquette). The only way to distinguish the various ground states is to 
do a measurement which is sensitive to the global topology of the system. 
Thus, it is in a topological phase. 

Consider now the excited states of the system. They are collections of 
localized excitations which come in two varieties: vertices at which A, L = \ 
and plaquettes at which F p = — 1. In other words, we have excitations which 
are endpoints of broken loops, which we can think of as 'electric' particles. 
Clearly, they can only be created in pairs. We also have excitations which 
are frustrated plaquettes: the state acquires a minus sign whenever a loop 
moves through this plaquette. We can think of these excitations as vortices 
or 'magnetic' particles. 



Figure 22.4: When an i„ = -1 electric excitation is taken around an F p = 
— 1 magnetic excitation, the wavefunction acquires a —1. 

Now, observe that when an electric particle is taken around a magnetic 
particle, we must move the curve attached to it though the excited plaquette, 
so the wavefunction acquires a phase w, as depicted in figure 22.4. Hence, 
electric and magnetic particles have non-trivial mutual braiding statistics. 
On the other hand, electric particles have bosonic statistics with themselves, 
as do magnetic particles. A composite formed by an electric and a magnetic 
particle is fermionic. 
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22.3 Effective Field Theories 

The basic physics of the Hamiltonian (22.1) is the relative statistics between 
the 'electric' and 'magnetic' particles. This is encapsulated in the following 
topological field theory: 

SbF = 7 LJ Sx dT e ^ e ^ - " ir^^ ( 22 - 4 ) 

where f v \ = d v a\ — d\a u , as usual. This theory is commonly known as 
'BF theory' because the field is usually called b^. However, is better 
notation because it is the canonical conjugate of a^, as may be seen from 
(22.4): 

— = e ijej [ai(x),e kj ej(0)] = iS ik S(x) (22.5) 

The time derivatives of the fields eo and ao do not appear in this action. 
They are Lagrange multipliers which enforce the constraints: 

f 12 = ^magnetic 

d 1 e 2 -d 2 e 1 =7rp clcctric (22.6) 

Thus, each magnetic particle is accompanied by 7r-flux of the gauge 
field. Electric particles 'see' the gauge field according to their coupling 
j^ lectric a M , so when an electric particle goes around a magnetic particle, the 
wavefunction changes sign. Conversely, each electric particle is accompanied 
by 7r-flux of the field, so when a magnetic particle goes around an electric 
particle, the wavefunction changes sign. 

We can quantize this theory along the lines of our quantization of Chern- 
Simons theory, to which it is related according to 

2S BF [e, a] = S C s [a + e] - Scs [a - e] (22.7) 

By introducing Wilson loop operators for a„, and quantizing their algebra, 
we can see that the theory (22.4) has ground state degeneracy 4 on the 
torus. Rather than recapitulating this, we will see how this structure arises 
in another version of the theory below. 

The transition into this phase can be understood as the deconfinement 
transition of Z 2 gauge theory, whose deconfined phase is described at low 
energies by the U(l) BF theory above. There are two different ways of real- 
izing such a theory. We could begin with a U(l) gauge theory with Maxwell 
action which is coupled to a charge-2 matter field. When this matter field 
condenses, the U{\) symmetry is broken to Z 2 . This construction can be 
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done directly in the continuum. Alternatively, one can work with Z2 gauge 
fields from the beginning. However, one must, in such a case, work on a 
lattice. Let us follow the latter avenue. We consider a 2 + 1 dimensional 
space-time lattice on which there is an Ising gauge field degree of freedom 
a z = ±1 on each link of the lattice. We will label them by a lattice site, x, 
and a direction i = x,y,T so that there are three links associated with each 
site. The action is the sum over all plaquettes of the product of a z s around 
a plaquette: 

S = -K OzO z o z o z (22.8) 
plaq. 

To quantize this theory, it is useful to choose temporal gauge, <r 2 (x, r) = 1 
for all x. In this gauge, the Hamiltonian takes the form: 

H = -^<T x (x,i) - K ^ o z o z a z a z (22.9) 
x '* spatial plaq. 

In temporal gauge, there are residual global symmetries generated by the 
operators 

G(x) = CT x (x,x)(7 x (x,y)cr x (x-x,:E)<7 x (x-y,y). (22.10) 

The extreme low-energy limit, in which this theory becomes topological, 
is the K — > 00 limit. In this limit, a z a z a z a z = 1 for every spatial plaquette. 

It is useful to define operators W[j] associated with closed curves 7 on 
the lattice: 

L[l] = II^(x,i) (22.11) 

x,ie7 

We also need operators Y[a] associated with closed curves on the dual lat- 
tice, i.e. closed curves which pass through the centers of a sequence of 
adjacent plaquettes. 

Y[a] = [J v x (x,i) (22.12) 

x,i_L« 

The product is over all links which a intersects. L[j] is analogous to a 
Wilson loop operator while Y[j] creates a Dirac string. 

Let us consider the space of states which are annihilated by the Hamil- 
tonian; this is the Hilbert space of the K — > 00 limit. When restricted to 
states within this Hilbert space, Lpy] and Y[a] satisfy the operator algebra 



L[ 1 ]Y[a] = {-l) I ^Y[a]L[ 1 ] 
[m,L[ a ]} = [Y[ 7 ],Y[a}]=0 



(22.13) 
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Now, it is clear that such an operator algebra can be represented on a vector 
space of the form derived in the previous section: 



The notable difference is that the allowed states must now satisfy the con- 
straints 



Again, a is obtained from a by performing the surgery operation )(— ► ^ on 
any part of a. 

If a is contractible, then Y[a] commutes with all other operators in the 
theory, so its effect on any wavefunction should be multiplication by a scalar. 
If we take this scalar to be 1, then we have the first constraint above. The 
second constraint is necessary in order to realize the operator algebra (22.14) 
and is also required by consistency with the first. As a result of the second 
line of (22.15), we can resolve crossings of a and 7 in either way since they 
are equivalent in the low-energy Hilbert space. 

22.4 Other P, T-Invariant Topological Phases 

It is clear that the structure which we have described above is rather gen- 
eral. Any system whose intermediate-scale degrees of freedom are fluctuat- 
ing loops can give rise to such a phase. This includes, for instance, domain 
walls between Ising spins. If the Ising spins lie on the vertices of a triangular 
lattice, the domain walls will lie on the honeycomb lattice, as depicted in 
figure 22.5. 

Suppose we wish to generalize this structure. We can modify the rela- 
tions and/or change the degrees of freedom to e.g. directed or colored loops. 
Consider the first approach. The relation ty[a] = ty[da] where da is any 
continuous deformation of the multi-curve a must presumably be satisfied 
by in topological phase. However, the other two relations can be modified. 
Suppose iff [a] = *[aU O] is modified to *[a] = d*[a U O]- Tnen tlie 
surgery relation must be modified as well since the surgery relation of figure 
22.3 is in conflict with d 7^ 1, as may be seen in figure 22.6. Thus, for any 
d 7^ 1, a new surgery relation which is consistent with it must be found. It 
turns out that for most d, such a relation cannot be found. Only for the 



£[7] *[{«}] = ("1) /(7 ' Q) *[{«}] 
y[ 7 ]*[{a}] = *[{aU 7 }] 



(22.14) 



^W]=*[{«uO}] 

*[{«}] = *[{a}] 



(22.15) 



392HAPTER 22. FRONTIERS IN ELECTRON FRACTION ALIZATION 




Figure 22.5: A set of basis states of a system of Ising spins on a triangu- 
lar lattice is equivalent to the possible loop configurations of a honeycomb 
lattice. 

y\ — i = ¥r-£2-i = X ^'\-^L^ = d^r — i 

Figure 22.6: If d ^ 1, the surgery relation of figure 22.3 cannot hold since 
the above contradiction would follow. 

special values 

A ( * 

a = cos 

\k + 2 

do such relations exist. The k = 1 case is the d = 1 phase which we discussed 
above. The next phase is the k = 2 phase, which has d = y/2 and the surgery 
relation The surgery relations become more complex with increasing k. 

+ n%] + vi i = o 

Figure 22.7: The surgery relation for d = y/2. 



(22.16) 
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22.5 Non-Abelian Statistics 

The basic feature of these generalizations is that their quasiparticles exhibit 
non-Abelian braiding statistics. This is a possibility which we neglected in 
our discussion of exotic statistics in chapter ??. Suppose we have g degen- 
erate states, ip a , a = 1, 2, ... ,g of particles at x±,X2, ■ ■ ■ , x n . Exchanging 
particles 1 and 2 might do more than just change the phase of the wave- 
function. It might rotate it into a different one in the space spanned by the 
ip a s, so that: 

4, a ^MlU b (22.17) 
On the other hand, exchanging particles 2 and 3 leads to: 

iia^Mll^ (22.18) 

If Mil and M lb do not not commute, M^M^ / M^M^, the particles 
obey non-Abelian braiding statistics. 

To see how such a phenomenon might occur, consider one of the topo- 
logical phases mentioned at the end of the previous section. Because such 
a phase will not have a surgery relation on two strands, there are multiple 
linearly-independent states with four particles, as may be seen in figure 22.8 



o o o o 

Figure 22.8: These two states are linearly independent, except for k = 1. 
In the k = 1 case, the surgery relation of figure 22.3 implies that these two 
states are the same. 

Thus, we have satisfied the first condition necessary for non-Abelian 
statistics: a degenerate set of states of quasiparticles at fixed positions. 
We further observe that braiding particles rotates states in this degenerate 
subspace into each other. For instance, taking particle 2 around 3 and 4 
transforms the first state in figure 22.8 into the second. The transformations 
enacted by braiding operations are such that the order is important, as may 
be seen in figure 22.9. 

The number of states with n-quasiparticles grows very rapidly, in fact 
exponentially ~ d n in the states with d = 2 cos alluded to in the previous 
section. 
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Figure 22.9: By switching the order of two exchanges, we obtain a different 
state. Starting from the state in the upper left, we can exchange quasipar- 
ticles 3 and 4 first and then 2 and 3 (depicted on the right). Alternatively, 
we can exchange 2 and 3 first and then 3 and 4 (depicted on the left). 

Non-Abelian braiding statistics can also occur in the quantum Hall 
regime. The likeliest candidate for such a state is the quantized Hall plateau 
observed at v = |. From the perspective of our earlier discussion of the hi- 
erarchy of Abelian states, this plateau is strange because v = 2 + \ has a 
fractional part which does not have an odd denominator. Thus, it cannot 
arise in the hierarchy. Thus, we must consider states outside of the Abelian 
hierarchy, such as the 'Pfaffian state', which we discuss below. There is 
some numerical evidence that this particular state is a good description of 
the ground state at v = |. 

The Pfaffian (ground) state takes the form 




(22.19) 



In this equation the last factor is the Pfaffian: one chooses a specific ordering 
zi, Z2, ... of the electrons, chooses a pairing, takes the product of the indicated 
factor for all pairs in the chosen pairing, and finally takes the sum over all 
pairings, with the overall sign determined by the evenness or oddness of the 
order in which the zs appear. The result is a totally antisymmetric function. 
For example for four electrons the Pfaffian takes the form 

11 11 11 
+ + . 

Z\ ~ Z 2 Z 3 - Zi Z\ - Z 3 Z4- Z 2 Z\ - z 4 z 2 - z 3 

When the wavefunction is expanded in this way, in each term the elec- 
trons are grouped in pairs. Indeed, the Pfaffian state is reminiscent of the 
real-space form of the BCS pairing wavefunction; it is the quantum Hall 
incarnation of a p-wave superconducting state. 



22.5. NON-ABELIAN STATISTICS 



395 



This wavefunction may be considered as a variational ansatz for electrons 
at filling fraction v = \ in the first excited Landau level (with both spins 
of the lowest Landau level filled) interacting through Coulomb interactions. 
This is the approach used in determining the relevance of this wavefunction 
to experiments, but for a theoretical study of quasiparticle statistics, it 
is more useful to consider this wavefunction (and quasihole excitations in 
it) as the exact zero-energy states of the three-body Hamiltonian below. 
Working with this Hamiltonian has the great advantage of making the entire 
discussion quite explicit and tractable. 

H = E E E - z j) s '& - ^) ( 22 - 21 ) 

This Hamiltonian annihilates wavefunctions for which every triplet of elec- 
trons i, j, k satisfies the condition that if i and j have relative angular 
momentum 1 then i and k must have relative angular momentum > 2. The 
Pfaffian state (22.19) satisfies this condition since i and j have relative an- 
gular momentum 1 only when they are paired, but if i is paired with j, then 
it cannot be paired with k. Since the distance between particles is propor- 
tional to their relative angular momentum, this roughly translates into the 
following: by pairing up and and getting near particle j, particle % is able 
to stay further away from all of the other particles, thereby minimizing its 
interaction energy. 

As in a superconductor, there are half-flux quantum excitations. The 
state 

n(*;-**) 2 n eH * |a/4 - pf ^- m){zk - m) ^- m){zk - m) ) 

. ,, ■ Zj %k 

]<k j •> 

(22.22) 

has half- flux quantum quasiholes at rji and 7/2 • These excitations have charge 
e/4. 

One includes 2n quasiholes at points rj a by modifying the Pfaffian in the 
manner 

pf (_!_) ^ P f ~ ~ W)-{zk ~ rjpKzk ~ r]a)... + (j <-> k) 

Zj Zfa Zj Zfc 

(22.23) 

In this expression, the 2n quasiholes have been divided into two groups of 
n each (i. e. here a, (3, ... and p, a, ...), such that the quasiholes within each 
group always act on the same electron coordinates within an electron pair. 
There are apparently 

(2")! 

— — r— 7 (22.24 
2n!n! v ' 



*2qh 
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ways of making such a division; the factor 1/2 arising from the possibility to 
swap the two groups of n as wholes. In fact, not all of these wavefunctions 
are linearly independent: the true dimension of this space of wave functions 
is actually 2™ _1 . 

Consider first the case of four quasiholes. The basic identity that has to 
be taken into account is, in its most primitive form, 

(Zi - 77l)(zi - 772)0*2 - 7?3)(^2 - m) - Ol - 771)0*1 - 7/3)02 - 774)(Z2 - 7/2) + Ol <~> 

= Ol - z 2 ) 2 {m - 774X772 - 773) • (22 

It will be convenient to abbreviate the left-hand side to (12) (34) — (13) (24). 
Then we have as an immediate consequence of (22.25) the relation 

(12)(34) - (13)(24) (771-774)02-173) 

(12)(34) - (14)(23) "" (t?! -773)02 -774) ' 1 j 

It is interesting that on the right-hand side the basic projective invariant 
of four complex numbers, the cross-ratio, appears. For present purposes, 
however, the important point simply that it is independent of the zs. An 
immediate consequence is that for two electrons and four quasiholes the 
three apparently different ways of constructing quasihole states are reduced 
to two through the relation 

(12)(34)0 1 -77 2 )03-774)+(13)(42)Oi-773)(774-772)+(14)(23)Oi-774)02-773) = . 

(22.27) 

Now we want to argue that (22.26) and (22.27) still hold good for any 
even number of electrons, N e . To see this we insert (22.25) into the Pfafhan 
of (22.23): 

(13)(24) (13)(24) (13)(24) 

Fr (13)(24) = A { • • -J 

Z 1 - Z 2 Z 3 - Z4 Z 5 - Zq 

= A (12X34^-^-^14^3 (12)(34)- 3 -^4) 2 7/147/23 _ ^ ^ ^ 

Z\ - z 2 z 3 - Z4 

where rjij = rji — rjj and A denotes the instruction to antisymmetrize on the 
zs. If we expand, 

(12X34) - Qi - z 2 ) 2 mm3 (12)(34) - Qg - z 4 ) 2 ri u ri 23 

Zl - Z 2 Z 3 - Z4 

= ^ ( MM ( 12 )( 34 ) ...) 

2:1-2:2 23-^4 

Z 3 - Z4 
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+ A ((21 - z 2 )r] 1A r, 23 x (z 3 - z 4 )7] U r] 23 x ii^KMl ...) + . (22.29) 

Z5 — Z% 

there will be terms on the right hand side of (22.29) with zero, one, two, 
. . . , N e factors of ( Zj — Zj ). Upon antisymmetrization, however, a term 
with k factors of (zj — Zj) would have to antisymmetrize 2k variables with 
a polynomial that is linear in each. Since this is impossible for k > 1, such 
terms vanish. Hence 

^ (12) (34) - (gi - z 2 ) 2 7 ll4 7 l23 (12)(34) - (z 3 - gjjWto , 
Zl - z 2 23-^4 
= i( (12P ( 12 )( 34 ) 

Zl - z 2 z 3 - z 4 

-^((z 1 -z 2 )7 ?14 r ?23 ...)• (22.30) 

z 3 - z 4 

Similarly, one has 

(12)(34) (12)(34) . 



Pf(14)(23) ="4(- 



Zl - Z 2 Z 3 - Z 4 

+ ^(^i-Z2)r?i 3 r?24 ®^ •••)• (22.31) 

z 3 — z 4 



From these we deduce the many-electron generalization of (22.25): 

P f (12)(34) - Pf(14)(23) = ( Pf (12)(34) _ P f (13)(24)) • (22.32) 

This is a linear relation among the three pairing possibilities for two quasi- 
holes. It depends on their coordinates but - remarkably - takes the same 
form for any number of electrons. 

Thus, we have shown that there are two four-quasihole states. In the 
same way, it can be shown that there are 2 n_1 2n-quasihole states. Braiding 
operations cause these states to be rotated into linear combinations of each 
other, as further analysis shows []. 
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Excitations in Dirty Systems 
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CHAPTER 23 



Impurities in Solids 



23.1 Impurity States 

In the previous parts of this book, we have discussed the low-energy excita- 
tions which result from broken symmetry, criticality, or fractionalization. In 
this final part of the book, we discuss the low-energy excitations which result 
from the presence of 'dirt' or 'disorder' in a solid. By 'dirt' or 'disorder', 
we mean impurities which are frozen into the solid in some random way. 
Consider phosphorous impurities in silicon. Presumably, the true ground 
state of such a mixture is one in which the phosphorus atoms form a super- 
lattice within the silicon lattice. However, this equilibrium is never reached 
when the alloy is made: it is cooled down before equilibrium is reached, 
and the phosphorus impurities get stuck (at least on time scales which are 
relevant for experiments) at random positions. These random static spatial 
fluctuations have interesting effects on the electronic states of the system: 
they can engender low-lying excitations and they can dramatically change 
the nature of such excitations. To see the significance of this, recall that, 
in the absence of a broken continuous symmetry, a system will generically 
form a gap between the ground state and all excited states. By tuning to 
a critical state - such as a Fermi liquid - we can arrange for a system to 
have low-energy excitations. However, in the presence of disorder, it will 
have low-energy excitations even without any tuning. To get a sense of why 
this should be so, suppose that, in the absence of disorder, there were a gap 
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to the creation of a quasiparticle. In the presence of disorder, the potential 
varies from place to place, and, in the thermodynamic limit, there will be 
some place in the system where the potential energy offsets the energy re- 
quired to create a qasiparticle. In this region, it won't cost any energy to 
create a quasiparticle. Note that such an excitation, though low in energy, 
may not have large spatial extent. By the same token, if the system were 
gapless in the absence of disorder, as in the case of a Fermi liquid, then 
disorder can extend the critical state into a stable phase and it can cause 
the low-lying exciations of the system to become spatially localized. 

Consider the simple example of a single phosphorus impurity in silicon. 
The phosphorus takes the place of one of the silicon atoms. Without the 
phosphorus, silicon is an insulator with a band gap E g . Phosphorus has an 
extra electron compared to silicon and an extra positive charge in its nu- 
cleus. Let us neglect electron-electron interactions and write Schrodinger's 
equation for the extra electron in the form: 

(^lattice + ^impurity) ^(r) = E ^(r) (23.1) 

where -lattice is the Hamiltonian of a perfect lattice of silicon ions and 
-f^impurity is the potential due to the extra proton at the phosphorus site. 
Let us write i/)(r) in the form 

V(r)= X (r)n ko (r) e ik »- r (23.2) 

where Uk ( r ) e* k °' r is the eigenstate of lattice & t the conduction band mini- 
mum. Then x( r ) satisfies the equation: 

where m* is the effective mass in the conduction band, E^ is measured from 
the bottom of the conduction band, and e is the dielectric constant of silicon. 

Hence, so long as we can neglect electron-electron interactions, the elec- 
tron will be trapped in a bound state at the impurity. If the binding energy 

2 

is much less than the band gap, ^ E g , then our neglect of electron- 
electron interactions is justified because the interaction will be too weak to 
excite electrons across the gap. (Note that as is the effective Bohr radius 
in silicon, as = -^hz ~ 20 A). Hence, in the presence of impurities, there 
are states within the band gap. If there is a random distribution of phos- 
phorus impurities, we expect a distribution of bound state energies so that 
the gap will be filled in. In other words, there will generically be states at 
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the chemical potential (although there will be a small density of states when 
the impurity density is small), unlike in a pure system, where it is possible 
for the chemical potential to lie within an energy gap. 

23.2 Localization 
23.2.1 Anderson Model 

What is the nature of these electronic states when there are several impu- 
rities? Presumably, there is some mixing between the states at different 
impurities. One can imagine that this mixing leads to the formation of 
states which are a superposition of many impurity states so that they ex- 
tend across the system. As we will see, this naive expectation is not always 
correct. Consider, first, the case of a high density of electrons and impurities. 
One would expect the kinetic energy to increase with the density, n, as n 2 ^ d 
while the potential energy should increase as n 1 ^, so that the kinetic energy 
should dominate for large n. When the kinetic energy dominates, we might 
expect some kind of one-electron band theory to be valid. This would lead 
us to expect the system to be metallic. How about the low-density limit? In 
the case of a single impurity, the electron is trapped in a hydrogenic bound 
state, as we saw in the previous section. What happens when there is a 
small, finite density of impurities? One might expect exponentially small, 
but finite overlaps between the hydrogenic bound states so that a metal with 
very small bandwidth forms. This is not the case in the low-density limit, 
as we will see in this section. 

Consider the Anderson model: 



In this model, we ignore electron-electron interactions. Spin is an inessential 
complication, so we take the electrons to be spinless. is the energy of an 
electron at impurity i, and tij is the hopping matrix element from impurity 
i to impurity j. These are random variables which are determined by the 
positions of the various impurities. 

One could imagine such a model arising as an effective description of 
the many-impurity problem. Since the impurities are located at random 
positions, there tunneling matrix elements between their respective bound 
states will be random tij. One might also suppose that their different lo- 
cations will lead to different effective environments and therefore different 
bound state energies e^. Anderson simplified the problem by arranging the 




(23.4) 
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sites i on a lattice and setting tij = t for nearest neighbor sites and zero 
otherwise. The effect of the randomness is encapsulated in the e^'s, which 
are taken to be independent random variables which are equally likely to 
take any value q G [—W/2,W/2]. This is a drastic simplification, but it 
already contains rich physics, as we will see. 

For W = 0, there is no randomness, so the electronic states are simply 
Bloch waves. The system is metallic. For W <C t, the Bloch waves are 
weakly scattered by the random potential. Now consider the opposite limit. 
For t = 0, all of the eigenstates are localized at individual sites. In other 
words, the eigenstates are \i), with eigenenergies e,. The system is insulating. 
Where is the transition between the t/W = oo and the t/W = limits? Is 
it at finite t/W = oo? The answer is yes, and, as a result, for t/W small, 
the electronic states are localized, and the system is insulating. (We say 
that a single-particle state is localized if it falls off as e~ r ^. £ is called the 
localization length) 

To see why this is true, consider perturbation theory in t/W. The per- 
turbed eigenstates will, to lowest order, be of the form: 



Perturbation theory will be valid so long as the second term is small. Since 
the Cj's are random, one can only make probabilistic statements. The typ- 
ical value of €i — €j is W/2. The typical smallest value for any given i is 
W/2z, where z is the coordination number of the lattice. Hence, we expect 
corrections to be small - and, hence, perturbation theory to be valid - if 
2tz/W < 1. On the other hand, this is not a foolproof argument because 
there is always some probability for a — €j small. Nevertheless, it can be 
shown (Anderson; Frohlich and Spencer) that perturbation theory converges 
with a probability which approaches 1 in the thermodynamic limit. Hence, 
there is a regime at low density (small t) where the electronic states are 
localized and the system is insulating, by which we mean that the DC con- 
ductivity vanishes at T = 0. From our discussion of the hydrogenic bound 
state of an electron at an impurity in a semiconductor, it is not surprising 
that there are localized states in a disordered system. What is surprising 
is that if the disorder strength is sufficiently strong, then all states will be 
localized, i.e. the entire band will consist of localized states. 

If the disorder strength is weaker than this limit, 2tz/W 3> 1, then 
we can do perturbation theory in the random potential. In perturbation 
theory, the states will be Bloch waves, weakly-scattered by impurities as 




■3 



(23.5) 
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we discuss in the next section. Such states are called extended states. The 
perturbative analysis is correct for states in the center of the band. Near 
the band edges, however, the perturbative analysis breaks down and states 
are localized: the correct measure of the electron kinetic energy, against 
which the disorder strength should be measured, is the energy relative to 
the band edge. Thus, for weak disorder, there are extended states near the 
band center and localized states near the band edges. When it lies in the 
region of extended states, it is metallic. 

When the chemical potential lies in the region of localized states, the 
system is insulating. To see this, we write the DC conductance, g, of s 
system of non-interacting fermions of size L in the following form: 

m = H (23.6) 

5E is the disorder-avergaged energy change of eigenstates at Ef when the 
boundary conditions are changed from periodic to antiperiodic. AE is the 
mean level spacing of eigenstates. (In order to derive such a relation an- 
alytically, one must consider disorder-averaged quantities.) If the states 
at Ef are localized with localization length £, then 5E ~ e~ L ^, and the 
conductance will vanish in the thermodynamic limit. 

This form for the conductance can be motivated as follows. The con- 
ductance is the response of the current to a change in the electromagnetic 
potential (which can be applied entirely at the boundary). However, the 
potential couples to the phase of the wavefunction, so the conductance can 
be viewed as the sensitivity of the wavefunction to changes of the boundary 
conditions. A systematic derivation proceeds from the Kubo formula. 



23.2.2 Lifschitz Tails 

As noted above, the states near the band edge are localized. The asymptotic 
density of such states can be obtained by the following argument due to 
Lifschitz. If the on-site disorder lies in the range [—W/2,W/2] and the 
hopping parameter is t then there are no states below the band edge, E m [ n = 
—2dt — W/2 on the d-dimensional hypercubic lattice. Consider a state near 
E m - m which is spread over N sites. Then each of these sites has on-site 
energy in the range [—W/2, — W/2 + I]. The probability of having such a 
configuration of neighboring sites is ~ (I /W) N . The kinetic energy of such 
a state is SE ~ t N~ 2 / d , so a state which is at energy E m i n + SE is spread 
over N sites with N ~ {5E/t)~ d / 2 . Hence, 

N(E miQ + SE) ~ e" c (*/^) d/2 (23.7) 
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where c is a constant. 

Thus, there isn't a van Hove singularity at the band edge at which ^ 
diverges, but rather an essential singularity at which ^ vanishes. 

23.2.3 Anderson Insulators vs. Mott Insulators 

The preceding analysis shows that a disorder-induced insulating state of 
non- interacting fermions is stable over a finite range of hopping strengths. 
What about the effect of electron-electron interactions? At high density, we 
expect the electrons to screen the impurity potentials, so that they are ac- 
tually of the form — e~ Ar . In the high-density limit, A — > oo, we expect the 
impurities to be extremely well-screened so that there won't be any impurity 
bound states. Hence, electron-electron interactions should, naively, enhance 
metallic behavior at high density. At low density, A — > 0, so the impurities 
are not effectively screened, and we expect the preceeding analysis to hold. 
To summarize, electron-electron interactions tend to stabilize the metallic 
state by screening the random potential due to impurities. However, when 
the density is sufficiently low, this screening effect is too weak to prevent lo- 
calization. As we will discuss in a later section, unscreened electron-electron 
Coulomb interactions do strongly influence the properties of the insulating 
state. 

Of course, electron-electron interactions can do other things besides 
screen the impurities. They can lead to ordered states such as superconduc- 
tivity or ferromagnetism, which would, of course, require a re-examination 
of the above analysis. In fact, sufficiently strong interactions can also cause 
insulating behavior even in an extremely clean system. At low densities 
where Coulomb interactions dominate over the kinetic energy, electrons will 
form a Wigner crystal. A tiny amount of disorder will pin this crystal, 
thereby making it insulating. Hence, it would be more accurate to say that 
electron-electron interactions enhance metallic behavior in the high-density 
limit but cause insulating behavior in the low-density limit. 

The term 'Anderson insulator' is usually used for an insulator which 
would be metallic in the absence of impurities. The implications is that it 
can be more or less continuously deformed into a non-interacting fermion 
system which is insulating as a result of Anderson localization. A 'Mott 
insulator' is an insulator which would be metallic in the absence of electron- 
electron interactions. The Wigner crystal is an example of a Mott insulator. 
Another canonical example is given by the Hubbard model: 

H = - tc \a c 3<> + h.c.E^n iT n iT (23.8) 

<i,j> i 
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For U = 0, it would be metallic at half-filling. However, for sufficiently large 
U there is precisely one electron. Because the energy cost for having two 
electrons on any site is prohibitive, the system is insulating. 

Doped semiconductors and many other real materials do not fall so 
neatly into one class or the other, however. While they might be metal- 
lic in the absence of both disorder and electron-electron interactions, their 
insulating behavior cannot be entirely ascribed to one or the other. Let us re- 
consider, momentarily, the formation of hydrogenic bound states in a doped 
semiconductor. If we were to neglect localization, we would still expect a 
sharp crossover between highly-conducting and poorly-conducting regimes 
at, respectively, high and low densities. The screened Coulomb interaction, 
V(r) ~ ^ e _Ar has no bound states in the high-density limit, where A is 
large. Therefore, as the dopant density is decreased, bound states will begin 
to appear when A falls below a certain value ~ as- This would lead to 
small conductivity. (But not zero yet because the exponentially-small over- 
lap between these bound states would still be expected to lead to metallic 
behavior, albeit with a very narrow band, were it not for localization.) An- 
derson localization of the impurity bound states will make the crossover 
from a metal to a poor metal into a sharp metal-insulator transition. Thus, 
it is more accurate to say that insulating behavior is due to the interplay 
between localization and interactions. 



23.3 Physics of the Insulating State 

At T > 0, any system will have finite conductivity since there will always 
be some probability that thermally-excited carriers can transport charge 
across the system. The conductivity of a finite-sized system will also always 
be non-vanishing. For instance, consider a non-interacting Fermi system 
whose single-particle states at the Fermi energy are localized with localiza- 
tion length £. Then the conductivity of the system will be a ~ L 2 ~ d e _L ^ 
in a system of size L. 

Thus, we define an insulator as any state in which the DC conductivity 
vanishes at T = in the thermodynamic limit. In this section, we discuss 
the properties of a disorder-driven insulating state. 

When a solid is driven into an insulating phase as a result of impurities, 
it is as far from a perfect crystalline lattice as possible, as far as its elec- 
tronic properties are concerned. Not only is translational symmetry absent, 
but physical properties are dominated by localized excitations so that it is 
difficult to construct even an averaged description which is translationally 
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invariant. However, some of these properties - the DC and AC conduc- 
tivities, magnetic response, and specific heat - can be deduced by simple 
arguments, which we present in this section. 

23.3.1 Variable Range Hopping 

One of the most celebrated results in the study of disordered insulators is 
Mott's formula for the DC conductivity due to variable range hopping. Let 
us consider an insulator at low temperatures. We imagine that the electrons 
are all in single-particle localized states. However, at finite-temperature, 
an electron can be excited by, say, a phonon so that it hops to another 
nearby localized state. Through a sequence of such hops, it can conduct 
electricity. One might imagine that electrons will only hop to the nearest 
neighboring states since the matrix element to hop to a more distant state 
will be exponentially (in distance) smaller. However, a state which is further 
away might be closer in energy, and therefore exponentially (in temperature) 
easier to reach. The competition between these competing effects leads to 
variable range hopping. 

Suppose that an electron which is in a localized electronic state ip hops 
to another one a distance R away. The number of available states per unit 
energy within such a distance is ^irR^Np, where Np is the density of states 
at the Fermi energy. Hence, the typical smallest energy difference between 
the state tp and a state within a distance R is the inverse of this: 

, , _ / typical smallest energy difference between \ 3 

\ states separated by a distance R J AttNfR? 

(23.9) 

The conductivity associated with such a hopping process depends on the 
matrix element (squared) between the two states, which we take to be ex- 
ponentially decaying ~ e~ 2aR (where R = 3i?/4 is the average hopping 
distance) and on the probability that the electron receives the activation 
energy needed to hop, which follows the Boltzmann distribution ~ e -P w ( R ) 
Hence, the conductivity due to such a hopping process varies as 

a R ~ e -2«R-PW{R) (23 1Q) 

The prefactor of the exponential contains various comparatively weak de- 
pendences on R; for instance, there should be a factor of R corresponding to 
the fact that a longer hop constitutes a larger contribution to the current. 

There are many such hopping processes available to our electron in state 
ip. The electron could hop a short distance with relatively large matrix 
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element, but the activation energy would probably be high in such a case. 
It could instead hop a long distance and find a state with low activation 
energy, but the matrix element would be very small. The optimal route 
is for the electron to hop a distance which minimizes the exponential in 
(23.10): 

-2a-/?^ = (23.11) 

or 

/ q x 1/4 
Ropt = ^ tt= (23.12) 



opt \2itaN F T 

Hence, as the temperature is decreased, the electron makes longer and longer 
hops. The resulting temperature dependence of the conductivity is: 

a(T) = a (T)e^ To ^ 1/4 (23.13) 

where <to(T) is relatively weakly-dependent on T, e.g. powerdaw. In d 
dimensions, the preceeding analysis immediately generalizes to 

<j(T)=o Q {T)e- (T °/ T ^ (23.14) 



23.3.2 AC Conductivity 

There are many situations in which the to S> ksT frequency dependence of 
a quantity can be obtained from its DC temperature dependence by simply 
replacing k^T with to. The conductivity of an insulator does not fall within 
this class. The AC conductivity is entirely different in form. 

At frequency u, an electron will oscillate between states separated in 
energy by u. Let us think about these resonant states in terms of hydrogenic 
bound states. There are two possibilities. (1) R is large, so that they are 
essentially individual, independent bound states separated in energy by the 
difference between their respective potential wells. Then the dipole matrix 
element is ~ Re~ aR ; the current matrix element is ~ ujRe~ aR . Since the 
matrix element will be exponentially suppressed, the contribution of such 
resonant pairs will be small, and we turn to the second possibility. (2) The 
distance R is not very large, so that the energy eigenstates are actually linear 
combinations of the hydrogenic bound states. Then the energy difference 
between the eigenstates is essentially the matrix element of the Hamiltonian 
between the hydrogenic bound states ~ Ioe~ aR . The dipole matrix element 
is ~ R; the current matrix element is ~ luR. There is no exponential 
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suppression because the two eigenstates are different linear combinations of 
the same two bound states. 

Then the conductivity due to this resonant pair is: 

a(uj) ~ (dipole matrix element) 2 • (available phase space) (23.15) 

The available phase space is the density of states per unit volume multiplied 
by the volume in which the states can lie, given that they are a distance R 
apart: ~ Np R 2 AR. AR is the size £ of a localized state. Hence, 

a{u) ~ {uRf R 2 ~ to 2 R A (23.16) 

However, by the arguments of the preceding paragraph, 

ln(7 /w) 

Thus, we have the following formula for the conductivity: 

ln 4 (/ /w) (23.18) 

This formula also holds at finite temperatures, so long as w > T. How- 
ever, when the frequency is decreased, the conductivity will cross over to 
the variable-range hopping form. 

23.3.3 Effect of Coulomb Interactions 

In an insulator, Coulomb interactions are not screened at long-wavelengths 
and low-frequencies because there aren't mobile charges. Thus, their effects 
are particularly strong. One effect, pointed out by Efros and Shklovskii is a 
suppression of the single-particle density-of-states at the Fermi energy. The 
basic physics is as follows. Suppose that there were two (strongly-localized) 
single-particle states, one just below the Fermi energy and one just above. 
The 'single-particle energy cost' associated with promoting an electron from 
the lower one to the higher one would be at least partially offset by the 
negative Coulomb energy associated with the resulting particle- hole pair. If 
the distance between the states is small, this Coulomb energy will be large 
and will overcompensate the single-particle energy. Hence, the states must 
be far apart and, therefore, the single-particle density of states (per unit 
volume) must be small. 

More quantitatively, suppose the single-particle energy separation be- 
tween an un-occupied state above the Fermi energy and an occupied one 
below the Fermi energy is Ai£ unocc _ occ while the distance between the states 



(23.17) 
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is r. The 'single-particle energies' include the Coulomb interaction energy 
between the two states under consideration and all of the other occupied 
states in the system. Then it must be the case that 

e 2 

A-£/ U nocc— occ (23.19) 

r 

or else the energy would be lowered by transferring an electron from the 
occupied state to the unoccupied one, contradicting our assumption. The 
right-hand-side is the Coulomb energy between the electron and the hole 
which results when such a transfer occurs. 

Hence, the number of states per unit volume which are within energy 
AE of the Fermi surface is 

n(AE) ~ -5g < (AE) 3 (23.20) 

from which we see that the density of states is 

N(AE) ~ (AE) 2 (23.21) 

Thus, the single-particle density of states vanishes at the Fermi energy, and 
is suppressed near it. This effect is called the Coulomb gap. 
Suppose we define energy, length scales A, ta by 




N r 3 A A = 1 (23.22) 

where Nq is the density of states in the absence of long-range Coulomb 
interactions, to which the density of states returns when the distance from 
the Fermi surface exceeds A = e 3 y/l%, as shown in figure 23.1. 

One could measure the single-particle density of states by performing a 
tunneling experiment in which current tunnels into a disordered insulator 
from a metallic lead to which it is contacted. The differential conductance 
dl/dV will be proportional to the density of states. According to the above 
arguments, dl/dV will be suppressed at zero voltage and will increase with 
voltage as V 2 . However, when we change the applied voltage on our disor- 
dered insulator, the Fermi energy moves with it. Shouldn't the differential 
conductance remain zero? The answer is no. As we change the voltage, 
dl/dV increases with voltage as V 2 , as if the chemical potential had not 
moved. However, if we raise the temperature of the insulator, change the 
applied voltage to Vq, and cool it down again, we find that dl/dV varies 
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Figure 23.1: The single-particle density-of-states is suppressed at the Fermi 
energy. For \Ee f \ > A, the single-particle density-of-states returns to Nq. 

as (V — Vo) 2 , so that the Fermi energy has moved to Vo- Essentially, the 
suppression of the density of states remains wherever it was when the sys- 
tem last equilibrated. The non-equilibrium nature of the Coulomb gap is 
related to the assumption above that the only Coulomb interaction energy 
which must be accounted for is that between the excited electron and the 
hole left behind, —e 2 /r. However, when an electron is excited from an oc- 
cupied state to an unoccupied one, all of the other electrons can and will 
rearrange themselves. This will occur in such a way as to allow a state near 
the Fermi surface. Thus, if an experiment were done slowly enough to allow 
such re-equilibration, the suppression calculated above would not occur. 

The suppression of the density-of states near the Fermi energy has con- 
sequences for variable-range hopping. Recall that we estimated that within a 
region of size R, the typical smallest energy separation AE is A E ~ l/(iVo-R 3 ). 
However, N(AE) ~ (AE) 2 , or AE ~ 1/R, as expected if the energy is de- 
termined by Coulomb interactions. Following the minimization procedure 
of subsection 23.3.1, we find that the variable-range-hopping formula in the 
presence of Coulomb interactions (in arbitrary dimension) is: 



23.3.4 Magnetic Properties 

In the absence of electron-electron interactions, there will be two electrons 
in each localized single-particle state, and the ground state will be para- 



a ~ e 



(T /T)V2 



(23.23) 
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magnetic. However, it is clearly unrealistic to ignore electron-electron in- 
teractions. We expect them to prevent two electrons from occupying the 
same state. Thus, we must consider the spin-spin interactions between the 
electrons in different localized states. 

An isolated impurity state, occupied by a single electron will have a 
Curie form for the magnetic susceptibility: 

Xc(T) ~ i (23.24) 

If we have many such spins but ignore the interactions between them, we 
will have a susceptibility of the same form, proportional to the density of 
spins. 

If, however, we include the interactions between spins, the susceptibility 
will have this form at high-temperatures, but once T becomes comparable 
to or less than the typical spin-spin interaction strength J, x(T) will exhibit 
different behavior. There are several possibilities. One, which occurs in, 
for instance, Cu:Mn and Au:Fe alloys is spin glass behavior, in which the 
susceptibility has a cusp at a temperature Tsg, as shown in figure 23.2. The 
non-linear susceptibility actually diverges at Tsg- Spin glasses are rather 
incompletely understood, and many of their properties are still the subject 
of controversy. However, the basic caricature is that the spins order, but 
unlike in a ferromagnet, in which they all point in the same direction, the 
spins point in a random (but fixed) direction which varies from point to 
point. At the freezing temperature, Tsg, the system undergoes a transition 
into such a state in which (S(x)) = So(x) where So(x) is a randomly- varying 
function of x. There is some disagreement about whether or not such a 
state has infinitely-many distinct degenerate ground states (over and above 
the degeneracy due to broken rotational symmetry) and the related issue of 
whether or not there is a sharp phase transition in non-zero magnetic field. 
These issues are discussed in (refs.). 

Spin-glass order is believed to require spin-spin interactions of both fer- 
romagnetic and antiferromagnetic signs which are sufficiently disordered as 
to frustrate either of these ordering tendencies of clean systems. As a result, 
the spins order in a novel type of ordered state which can occur only in a 
dirty system. An alternate possibility which can occur in a system which 
has predominantly antiferromagnetic interactions (but of randomly- varying 
strengths) is that the spins do not order but rather form a random singlet 
phase. In such a phase, the system essentially breaks up into a set of pairs 
of spins which interact with strong antiferromagnetic exchange couplings Jj. 
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Figure 23.2: The susceptibility of a spin-glass has a kink at T$q. 

Each of these pairs has susceptibility 

X^AT) = \ j^j- (23.25) 

The susceptibility of the system must be summed over all pairs with some 
probability distribution, P(J), for the values of J: 

^ = hj dJP{J) jT^ (23 - 26) 

At a given temperature, T, there will be some spins whose J is smaller than 
T. Such spins will not have formed singlets yet, so they will give a Curie-like 
contribution to the susceptibility, which will be the dominant contribution: 

Xu„paircd(T) ~ (23.27) 

where p(T) is the number of unpaired spins at temperature T. 

The function p(T) depends on the distribution of exchange couplings, 
P(J). Let us suppose that there is a small density n of randomly-situated 
localized spins, and suppose that the coupling between spins separated by a 
distance r is J(r) = Jo e~ 2r l a for some constant a which is roughly the size 
of the localized states. With this functional form for J, we know P(J) if we 
know P(R) such that P(R)dR is the probability that a spin has its nearest 
neighbor lying in a thin shell between R and R + dR. We can compute 
P(R) in the following way. Define p{R) as the probability that the nearest 
neighbor is a distance R or less. Then 

P(R) = ^ (23.28) 
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The probability that the nearest neighbor is between R and R + dR is given 
by the number of impurities in a thin shell at R, AimR 2 dR, multiplied by 
the probability that there is no closer neighbor, 1 — p(R) 

P(R) = AimR 2 {l - p(R)) (23.29) 

or, simply, 

% = 4TrnR 2 (l-p) (23.30) 
dR 

Solving this differential equation, we have 

p(R) = 1 - e - 47rnR3 P{R) = AimR 2 e - 4 ™« 3 (23.31) 
We substitute this expression into (23.26), 

X(T) = ±JdR AnnR 2 (23 . 3 2) 

In the low-temperature limit, this is simply 

X (T) « 1 J dR 87mR 2 e -t™« 3 e~ pJo ™p(-2RH ( 2 3.33) 

The integrand has a saddle-point at R « | ln(Jo/T), dropping lnlnT terms 
which are subleading at low T. Hence, the integral can be evaluated by 
saddle-point approximation: 

X (T) ~ I e -\™aHv?{.h/T) (2$M) 

The physical interpretation of this result is simple. Unpaired spins at tem- 
perature T are those spins whose largest coupling to the other spins is less 
than T: 

J(R)~T fl~|ln(J /T) (23.35) 

The density of such spins should therefore be: 

p(T) ~ e -s«™ 3 in 3 (.7o/T) 

as we found above. 

The approximate form which we used for P(R) underestimates the num- 
ber of unpaired spins, thereby underestimating the susceptibility. We took 
P(R) to be the probability of having a neighboring spin at R, namely AirnR 2 , 
multiplied by the probability that there were no closer neighbors, 1 — p(R). 
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However, the neighboring spin at R might itself have already paired up with 
a third spin, in which case our spin must look for an even further neighbor 
with which to form a singlet pair. Thus, we should multiply by the prob- 
ability that the neighbor at distance R doesn't have a closer neighbor of 
its own, which is roughly 1 — p(R). (It should really be a fraction of this 
because we should exclude the region which is less than a distance R from 
both spins since this region has already been accounted for by the first factor 
of 1 — p(R)-) Hence, 

P(R) = 4TmR 2 (l - p(R)) 2 (23.37) 

or 

47T71 R 2 

P(R) = ™ , (23.38) 

(l + f7mi?3) 2 

Substituting this into (23.26), we find the low-temperature form 

X (T) « i f dR 87TnR2 2 «P("2*/«) (23.39) 

The saddle-point is again at R « |ln(Jo/T), up to lnlnT terms. Hence, 
the saddle-point approximation for the integral is: 

X(T) 5- (23.40) 

V ' Tln 3 (J /T) V ; 

Hence, the number of unpaired spins, 

p(T) - (23.41) 

PV ' ln 3 (J /T) V ; 

goes to zero very slowly; so slowly, in fact, that the susceptibility diverges 
at T -► 0. 

In ID, the asymptotic low-temperature behavior can be found exactly 
(refs.). 

Exercise: Compute the specific heat at low T in a random singlet phase. 

23.4 Physics of the Metallic State 

23.4.1 Disorder- Averaged Perturbation Theory 

A metallic state in a disordered system can be approached perturbatively 
from the clean metallic state. In order to do so, we will need to handle 
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the perturbation caused by impurity scattering. In this section, we describe 
the simplest way to do so. Let us imagine breaking our system into a large 
number N cells, each of which is macroscopic. Then the configuration of 
impurities in each cell will be different. If we compute an extensive quantity, 
such as the free energy, then it will be essentially (up to boundary terms) 
the sum of the free energies of each of the cells. Since each cell will have 
a different impurity configuration, the sum over cells will be an average 
over impurity configurations. Hence, the free energy in a disordered system 
can be computed by computing the free energy for a fixed realization of the 
disorder and then averaging over all realizations of the disorder (with respect 
to some probability distribution for the disorder). Such an approximation 
(which we take to be exact in the thermodynamic limit) for the free energy is 
called the quenched approximation and such an average is called a quenched 
average. 

Let us consider a perturbative computation of the free energy. For the 
sake of concreteness, suppose that the action is 



Sty, tf] = S c l can ty,^]} + d T d 3 X V{x)lfi {x,t)^{x, t) (23.42) 



Then there is a vertex in the theory of the form depicted in figure 23.3a. 
To each such vertex, we assign a factor of the Fourier transform of the 
potential, V(q) and integrate over q. We can therefore organize the free 
energy in powers of V(q): 



If we think about computing the free energy perturbatively in some 
coupling constant in Sdcan (which includes the simplest case, namely that 
•Scican is a free theory), then the free energy will only contain connected 
diagrams. 

Now suppose that we average over V(q). Let us consider, for simplicity, 
a distribution for V(q) of the form 



where rij is the impurity concentration and v is a measure of the strength of 
the scattering potential due to each impurity. All higher moments are taken 
to be factorizable into products of VV. In other words, we will average 




K(q) = o 

V(q)V(q!) =»y% + q') 



(23.44) 
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Figure 23.3: (a) The impurity vertex for a fixed V(q). (b) The induced 
'interaction' which results from averaging over the propability distribution 
for V(q). 

F [V(q)] over the distribution: 



Integrating over V ties together different impurity vertices with impurity 
lines in all possible ways, according to Wick's theorem. To each such line, 
we attach a factor of niv 2 (5(q + q'), as shown in figure 23.3b. 

In essence, we have a new Feynman rule associated with impurity lines 
connecting impurity vertices. There are two important differences with most 
of the Feynman rules which we have considered thus far. (1) No energy flows 
along these lines, only momentum. (2) The diagrams must be connected 
even if all of the impurity lines are cut. In particular, if the clean theory 
is a non-interacting theory, then the disorder-averaged perturbation theory 
can have no closed electron loops. For example, figure 23.4a is allowed, but 
23.4b is not. 

Correlation functions are obtained from functional derivatives of the free 
energy with respect to sources, so the same rules apply to them. 




(23.45) 



23.4.2 Lifetime, Mean-Free-Path 



The properties of a dirty metal can be derived neatly with the use of an 
effective field theory which we will discuss in the next chapter. Therefore, 
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(a) 



(b) 



Figure 23.4: (a) An allowed diagram in disorder-averaged perturbation the- 
ory. This diagram is the lowest-order contribution to the self-energy, (b) A 
disallowed diargam in disorder-averaged perturbation theory. 

we will just summarize these properties here and describe their qualitative 
features. 

Many of the properties of a dirty metal are qualitatively the same as 
those of a clean metal: the compressibility is finite and approximately 
temperature-independent at low-temperatures, as is the magnetic suscep- 
tibility. The specific heat is linear in T. The principal new feature in a dirty 
metal is that the DC conductivity does not diverge at low-temperatures; it 
approaches a constant value at T = 0. 

This can be seen with a perturbative calculation. Consider the electron 
Green function. The lowest-order contribution to the self-energy comes from 
the diagram depicted in figure 23.4a. We will call the imaginary part of 
this diagram (i/2r) sgn(e n ), for reasons which will be clear shortly. If the 
interaction between the electrons and the impurities is 



tfimp = / (03 ^ ^ V ( k ' ~ k ) ^ £ ) e) (23-46) 



where, for simplicity, we take 



V{q) = 



F(q) V(q!) = n t v 



<V 3 )(q + q') 



(23.47) 



The disorder-averaged value of this diagram is equal to 
— sgn(e n ) = ImS(e,fe) 
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d 3 k' V(k - k') V(k' - k) 



(2vr) 3 ie n - £ k j 

= 2vriV> n iV 2 sgn(e n ) (23.48) 

The real part merely renormalizes the chemical potential. 

If we add this self-energy to the inverse Green function, we have a Mat- 
subara Green function of the form: 

<?(k, en) = — — t — (23.49) 

«e„ - 6 + 27Sgn(e n ) 

The corresponding retarded and advanced Green functions are: 

GW,adv(k, e) = -i— - (23.50) 

e - Cfc ± 27 

In the presence of impurities, momentum eignenstates are no longer en- 
ergy eigenstates. The inverse-lifetime 1/r is the energy-uncertainty of a 
momentum eigenstate. Writing = vp\k — kp\, we see that the mean-free- 
path £ = vft is the momentum uncertainty of an energy eigenstate. It is 
essentially the average distance between scatterings of an electron since the 
momentum is the inverse of the spatial rate of change of the wavefunction 
and the latter is scrambled at collisions. 

Fourier transforming the fermion Green function at the Fermi energy 
into real space, we find 

G rct (x,e) ~ e -M/* (23.51) 

Thus, the single-particle Green function decays exponentially, unlike in a 
clean system, where it has power-law behavior. 



23.4.3 Conductivity 

With this single-particle Green function, we can perturbatively compute the 
conductivity. Consider the basic 'bubble' diagram, figure 23.5, at q = but 
with each electron line representing the Green function (23.49) rather than 
the bare Green function. 



(j(0,u n )j(0, -uj n )} = 

\vlN F f di k \Y,-— -■ \ T i • 7—^ ( 23 - 52 ) 
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j(q,0 




j(-q-CO) 



Figure 23.5: The basic conductivity bubble. The electron lines represent 
lines dressed by the one-loop self-energy correction computed in the previous 
section. 



where we have introduced the shorthand r n = rsgn(e„). The factor of 1/3 
comes from the angular average over the Fermi surface. As usual, we convert 
the sum over Matsubara frequencies to an integral 

_ v JL Np [ f^n F (z) : (23.53) 

^'z+m ^'z 

The integrand is non-analytic if z or z + iuj m is real so the integral is equal 
to the contributions from the contour as is passes just above and just below 
these lines. Let us assume that uj n > 0. 

0'(o,w n )j(o,-w n )) = 

-*3lN f df f°° — - ( 1 1 \ 

v 2 F Ar f „ f°° de , . ( 1 1 \ 1 
- -fN F / di k / ^-n F (e) — — : 

(23.54) 



Only the second term in parenthesis in the first line of (23.54) contributes to 
the imaginary part of the integral and only the first term in parenthesis in 
the second line of (23.54) contributes to the imaginary part of the integral 
because the other terms have poles on the same side of the axis. Hence, we 
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have 



0'(0,u; n ) j(0, -u n )} = 

v 2 f f°° de 
-^N F / d£ fc / — n F (e) 



2ni € + iuj n - £ k + ± e - £ k - £ 
-f^/de*/ o-.n F (e) . : r (23.55) 



-oo " £ - 6c + 27 e ~ ik>n - £fc - 27 

Taking iu; n — > w and dividing by a;, we have 
<t(w) = 

1 «2 /* d e 1 1 

- "f ^ / d£ fc / — Me — — 

1 v£ AT f „ f°° de . , 1 1 

^3 J J_oo 2vr e -£ k + — e - u -£ k - — 

(23.56) 

Shifting e — > e + w in the second integral and combining the two terms, we 
have: 



a{u) = 

1 4 » r /" ,, f°° de , 

_ —E-Np J di k J — (n F (e) - n F (e + w)) - 



(23.57) 

Taking w — > and noting that (np(e) — n F (e + w)) /o> — ► 5(e), we have 

= - vpr N F 

= z^kpi (23.58) 

As expected, it is a constant determined by the lifetime due to impurity 
scattering. In the final line, we have written v F N F = kp/ir 2 . (This is the 
3D expression. In general, it is proportional to hp" 1 .) 
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The finite conductivity of electrons reflects the fact that electrons move 
diffusively rather than ballistically. If we define a diffusion constant D = 
| v^pT, then a = NpD, which is the Einstein relation. Further insight into 
this can be gained by computing the density-density correlation function, to 
which we turn in the next section. 

Note that we have not included vertex corrections in our calculation. 
This is not a problem because there are no vertex corrections to the current 
vertex to this order, as may be seen from the Ward identity, which relates 
vertex corrections to the derivative of the self-energy. 

T^P,P,0) = ^^ (23.59) 

As usual, we have used the notation p^ = (cu,pi), i = 1, 2, 3. Since the self- 
energy is momentum-independent, there are no vertex corrections to the 
current vertices T . On the other hand, the self-energy is energy-dependent, 
so there are important vertex corrections to the density vertex, T°. 

Note that we computed this integral by performing the energy integrals 
first and then the momentum integrals. When these integrations do not com- 
mute, e.g. when the integrands are formally divergent by power-counting, 
it is safest to introduce a momentum cutoff. When this is done, the order 
of integrations does not matter. A contour integration over ^ cannot be 
done, however, as a result of the cutoff, so the most natural way to proceed 
is to perform the energy integral first. The resulting momentum integral 
is then convergent in most cases. When the integrals are convergent, it 
doesn't matter in what order they are done. When they do not converge, 
as in the present case, the order does matter. The order in which we did 
the integrals is safest since it can be done with a momentum cutoff. If we 
had ignored this subtlety and done the integrals in the opposite order, and 
done the momentum integral without a cutoff (e.g. by contour integration) 
then we would have actually obtained the same result for the real part of the 
current-current correlation function. However, the imaginary part, which 
includes a term which cancels the diamagnetic term, would be missing. 

23.4.4 Diffusion 

The finite diffusion constant of electrons, D, which we found indirectly in the 
DC conductivity above, can be obtained explicitly by computing the density- 
density correlation function. As we will see, this correlation function has a 
very different form in a disordered electron system. 
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Figure 23.6: (a) A geometric series of diagrams which contribute to the 
density-density correlation function, (b) The quantity T(q, e n , e n +uj m ) which 
is the ladder sum which sits inside the density-density bubble. 



Consider the set of diagrams in figure (23.6)a (with, again, the electron 
Green function dressed with the one- loop self-energy, i.e. the lifetime). All of 
these diagrams contribute to the density-density correlation function. They 
form a geometric series which we can sum once we have obtained the value 
of the first. The higher-order diagrams constitute vertex corrections to the 
first diagram in the series. As noted in the previous subsection, there are 
no vertex corrections to the conductivity (i.e. the corresponding diagrams 
for the current-current correlation function do not correct the vertex in the 
DC limit, which is why we computed only a single diagram). However, such 
vertex corrections are important for the density-density correlation function. 
We will show that the resulting correlation function is consistent with the 
Ward identity. 

The sum of these diagrams is 

(p(q, u) m ) p(-q, -u m )) = ~2J J j^-^T(q,e n ,e n +uJ m )g(k,e n )g(k+CL,e n +u} r , 

(23.60) 

where F(q, e n , e n + oj m ) is the infinite series of ladder diagrams in (23.6)b 
+ u m ) = 1 + riiV 2 I(q, 

+ u; m )) 2 + ... (23.61) 

and the integral I(q, e n , e n + u> m ) is given by 

I(q,e n ,e n + uj m ) = J -^—^ <?(k, e n ) G(k + q, e n + u m ) (23.62) 
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Written explicitly, 

N F 



/ d£ fe d(cos0)- — 



2 7 ie n + iuj m - 6 - «F9 cos 9 + ie n - £ fc + ^ 

(23.63) 

where we have used the shorthand r„ = rsgn(e„). Integrating we have: 
I(q,e n ,e n +u m ) = 

7 iw m - v F qcos9 + - [0(e„ + w m ) - 0(e„)] 

Integrating cos 0, 

I(q,e n ,e n + w m ) = 

"T-N> rfl , \ /» / . A11 fiWm - «F? + ; [9(e„ + W m ) " ^(6n)] \ 

0(e„) - 0(e n + u m )\ In %— — — 

v F q y lLOm+VFq+ l[e(e n +u; m )-6(e n )]J 

iixNp (l + (uj m T + iv F Tq)[6(e n + uj m )-6(e n )] 

[V{£n) - U{e n + L0 m )\ in 



vfi? ' V 1 + Kt - WFTg) [0(e„ + u m ) - 0(e n )L 

(23.65) 

Expanding to lowest non-trivial order in to m and q, we have 

I(q,e n ,€ n + u} m ) = - l -— -\9(e n + uj m ) - 9(e n )] 2 x 
v F q 

2iv F rq - 2iv F rquj m T [9(e n + uj„ l ) - 9{e n )\ + ^(iv F Tqf^J (23.66) 

This expression vanishes unless e n , e n+m have opposite signs, in which case: 

I(q, e n (e n + Wm) < 0) = 2itN f t ^1 - |w m | r - ^v F T 2 q 2 

= 2irN F T(l-\uj m \T-DTq 2 ) (23.67) 

where the diffusion constant -D is given by D = v f t/3. 
Summing the geometric series (23.61), 

1 

T(q,e n ,e n +uj m ) = 



1 - mv 2 I(q,e n ,e n + u r , 
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1 



1 - (1 - \u m \ t - Drq 2 ) [9(e n + u m ) - 9(e n )f 

, n 9 — £n£n+m) + &{^n^n+m) (23.68) 

+ Dq A 



T \0J 



where we have used 2tt NprriiV 2 = 1. The same form is obtained in any 
dimension, with minor differences, such as D = Vpr/d in d dimensions. 
This form indicates that the electron density diffuses. 

To make this a little more concrete, consider the density-density corre- 
lation function (23.60): 



1 r ^3jj. 

(p(q,u> m ) p(-q,-u m )} = J ^^6>(e„e„ +m )£(k,e n )£(k+q,e n -h^ m ) 



/ j^S \ KJTW 0(~e n e n+m ) G(k, e n ) 0(k + q, e n + ^ 

(23.69) 



The first term is essentially the static response of free fermions with r = oo 
up to corrections of order 1/r, LO m : 



1 , f (i 3 k 

J ■^3 6l ( e rten+m)^(k,e n )a(k + q,e n + cj m ) = 



d 3 k 

= —Np + 0(l/r, u; m ) (23.70) 



lv^ /" d 3 k 1 1 ^, , 

/ 7o~TT~ c a~ T +0{1/T,uj m ) 

8^ J (2ttY ie n -^ k -v F q cos 6 ie n -i k 



In order to do this integral correctly, we must integrate frequency first or 
else work with a cutoff in momentum space. Integrating in the opposite 
order will give us an answer which vanishes in the to m = limit, i.e. we will 
miss the static part. By separating the density-density correlation function 
into static and dynamic pieces - the two terms of (23.69) - we are left with 
a dynamic piece which is strongly convergent because the Matusbara sum 
is over a finite range. Hence, we can do the integrals in the opposite order 
in evaluating that term, to which we now turn. 
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The second term can be evaluated by doing a contour integral: 
7j£ / 7^3 \ J^TD^ 0{-e n e n+m ) <7(k, e n ) G(k + q, e n + u m ) = 

P 2 \^m\+Dq 2 iuj m -v F qcose+^[e(e n+m )-9(e n )} 

_27diV>^l i #(-e„e„ +m ) 



/3 2 ^ r |w m | + Dg 2 



fr n2 (23-71) 



Combining the two terms, we have 



Dq 2 

(p(q, co m ) p(-q, -uj m )) = -N F -f— T (23.72) 

This correlation function has a pole at |cj m | = —Dq 2 . In order to continue 
to real time, observe that f(z) = — izsgn (Im(z))) = |u; m | if z = ioj rn . If we 
continue to z = lo + i<5, then we have f(u + id) = —iu, so that the retarded 
density-density correlation function becomes: 

(p(q, u>) p(-q, -u,)) = -N F ( _^ 2 pg2 ) (23.73) 

The existence of a pole at iu = —Dq 2 means that the Fourier transform has 
the form: 

<[p(x, t),p(0, 0)]> 6{t) ~ e"^ 0(t) (23.74) 

Hence, the electron density moves diffusively since a small, highly localized 
wavepacket prepared at t = will spread out over a region of size x ~ 

t l/2 

at time t. 

We obtained a diffusive form for the propagation of electron density by 
summing a particular class of diagrams, but these are clearly not the only 
possible diagrams. However, since the density-density correlation function 
which we obtained vanishes at q = 0, as required by the Ward identity (see 
below for a recapitulation of this fact), our neglect of other diagrams is at 
least consistent with charge conservation. There is a further justification 
for neglecting other diagrams: they are suppressed by a power of X/kpt. 
(Exercise: check this.) However, we will find in the next section that 
such an argument is too quick. 
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With the density-density correlation function (23.73) in hand, we can 
re-visit the conductivity. Charge conservation tells us that 

d t p + V-i = (23.75) 

Fourier transforming this equation, we have the following relation between 
correlation functions: 

Q 2 j(-q, -u>)) = (p(q, u) p(-q, -u)) (23.76) 

(The vanishing of the right-hand-side for q = confirms that our expres- 
sion for the density-density correlation function satisfies the Ward identity.) 
Hence, from the previous expression for the density-density correlation func- 
tion, we have 

-{j{q,u)j{-q-u J ))=N F . ™ D (23.77) 
iuj iuj + Dq z 

Taking q = 0, and then to — > 0, we have the DC conductivity. 

The spin-spin correlation function also has a diffusive form. Consider, 
for instance, the S z — S z correlation function. For non-interacting electrons, 
it is precisely the same as the density-density correlation function because 
S z = Pi — Pi implies that (S Z S Z ) = (p-fp-f) + (p-fPt) = (pp) (^ ne cross-terms 
vanish for non-interacting electrons). By rotational symmetry, the other 
spin-spin correlation functions must be identical. This can also be easily 
seen by direct calculation of (S+S-): the electron line will have spin +1/2 
while the hole line will have spin —1/2, but the diagrams will be unchanged 
from the density-density calculation because the fermion propagators are 
spin- independent : 

(S+iq^S-i-q-u,)) = -N F ( _jy Dg2 ) (23-78) 

In the presence of impurities, electrons no longer move ballistically at 
long length scales. They move diffusively because they undergo many col- 
lisions with impurities. We can think of this in the following RG language. 
Consider the effective Lagrangian for a Fermi liquid in the presence of im- 
purities. 
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To avoid clutter, we suppress the marginal four- Fermi interactions parametrized 
by Landau parameters which are inessential to this discussion. 

For a fixed V(k), this term is precisely marginal. This is what would 
be expected from a single-impurity. Consider, now, a random V(k). When 
we integrate over different impurity configurations according to (23.47) we 
generate the effective action 



with the proviso that we should only allow diagrams which would remain 
connected even if all of the impurity lines (connecting the two pairs of 
fermions in the second line in equation (23.80)) were cut. (We will intro- 
duce a formal way of implementing this in the next chapter.) The resulting 
impurity-scattering term is strongly-relevant, scaling as s _1 . (It is similar 
to a four- Fermi interaction but with one fewer energy integration.) Thus 
the clean Fermi liquid is unstable to impurity-scattering. The RG flow is 
towards the diffusive Fermi liquid. At this fixed point, the density-density 
and spin-spin correlation functions takes the diffusive form (23.73), (23.78) 
while the single-fermion Green function (23.49) is short-ranged as a result 
of the ijr in the denominator. In the next chapter, we will construct an 
effective field theory for the long- wavelength, low frequency modes of the 
diffusive fixed point. 

23.4.5 Weak Localization 

In the previous two subsections, we expended substantial effort to compute 
the electron Green function, and the density-density and spin-spin correla- 
tion functions. However, the result of all of these diagrammatic calculations 
was no more than simple qualitative considerations and naive Drude theory 
would have given us: electrons have a finite conductivity which is determined 
by the lifetime r. This lifetime also enters the diffusive form of various cor- 
relation functions through the diffusion constant D = v 2 F T/d. One hardly 
needs sophisticated field-theoretic techniques to discover this, although it 
is, perhaps, some consolation that we can confirm our intuition by direct 
calculation. 




+ riiV 




d 3 k d 3 k' de d 3 p d 3 p' de' 

(2tt) 3 (2tt) 3 2^ (2^) 3 (2vr) 3 2^ 



^ t (k,6)^(k / ,6)V t (p,e / )V(p^e , ) 

(23.80) 
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Figure 23.7: (a) Diagrams contributing to the conductivity with impurity 
lines maximally-crossed, (b) A re-drawing of the maximally-crossed dia- 
grams which emphasizes the role of the particle-particle diagrams shown in 
(c). 



One might naively think that, in the absence of interactions, this is 
all that there is to the conductivity of a dirty metal. However, the sub- 
leading (in impurity contributions to the conductivity are also inter- 
esting. They are due to quantum interference effects. In two dimensions, 
the 'subleading' correction is divergent at low-temperatures and, in fact, 
non- interacting electrons are always insulating in d = 2. Field-theoretic 
techniques are invaluable in obtaining and understanding these corrections. 

Consider the diagrams of figure 23.7a. In these diagrams, the impurity 
lines are maximally-crossed. This is a particular class of diagrams contribut- 
ing to the conductivity. In this subsection, we will see why these diagrams 
give a significant contribution to the condutivity. Observe that they can 
be re-drawn in the manner shown in figure 23.7b. The diagrams of 23.7b 
are very similar to those of the diffusion propagator. The main difference is 
that the arrows are in the same direction, rather than opposite directions, 
i.e. the middle of the diagram is a particle-particle diagram rather than a 
particle-hole diagram. However, time-reversal symmetry relates these two. 
Call the sum of particle-particle diagrams in figure 23.7c W(k, k', e n , e n+m ). 
By time-reversal symmetry, we can reverse the direction of one of the arrows 
if we also reverse the sign of the momentum. Because particles and holes 
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k k+q k+p k k k+q k+p k 



k' k'-q k 7 -p ^ -1c -kVq -kVp -k 

Figure 23.8: A particle-particle diagram is equal to a particle-hole diagram 
with particle momenta reversed to become hole momenta. 



both cost positive energy, but carry opposite momentum, a hole line running 
through a diagram can be replaced with a particle line at the same energies 
but opposite momenta without changing the value of the diagram. This is 
depicted in figure 23.8. From this observation, we see that: 

W(k, k', e n , e n+m ) = {riiV 2 ) 2 /(k + k', e„, e n+m ) 

+ ... 

(rnv 2 ) I(k + k', e ra , e n+m ) 
1 - (mv 2 )I(k + k', e n , e n+m ) 



If we substitute this into a conductivity bubble to compute its contribu- 
tion to the conductivity, as in figure 23.7b, we obtain 



r / x 1 vl [ d d k d d k' 1^,, 

s*M = —-j J ^fLP^ e «) ^ e « + ^) x 

W(k, k', e n , e n+m ) G(k', e n ) Q(k', e n + uj m ) (23.82) 

From (23.81), we see that IF(k, k', e n , e n + m ) is sharply peaked at k ~ — k'. 
Hence, we write q = k + k' and change the variables of integration to k, q. 
In the second pair of Green functions, we take k = — k', neglecting the weak 
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q dependence of those factors. We now have: 

x , x 1 AT v 2 F f f d d q l^n iV 2 ( 6{-e n e n+m ) 

Sa{uj m ) = N F — / di k / — — -> 2 

u) m d J J (2TT) d t \\uj m \ + Dq 2 



1 



47ri /" <2 d g 1 sr^n-iv 2 ( 9(e n+m ) - 



—-N / ^ > 



uj m d J (2tt)<* (3^ t V l^ml + ^g 2 

\ 3 

1 



iuj m + ^(6>(e n+m ) - 0(e n )) 



4n„ v| f <A l^n^ 2 ( 9{e n+m ) 



1 \ 



47T 3 /" <A l^ n^ 2 / (9(-e ra+m e n ) 



vr y (2vr) d |w m | + Dq 2 
27T d / 2 1 / 1 



vr r(f)(27r) d a!-2 V V D 



d-2 ' 
2 



(23.83) 



We have taken the ultraviolet cutoff to be l/£. For shorter wavelengths, 
one should use ballistic, rather than diffusive propagators, so this is the 
appropriate cutoff. 

While (23.82) is a small correction in the low frequency limit for d > 2, 
it is divergent in this limit for d < 2. In d = 2, it is; 

Sa(u m ) = --rL ln( 



47T 2 Vl^ml V 2 



do 7t&!,f£ \ |u; m | £ 2 

which is also divergent. 

Hence, in d < 2, the correction (23.82)-(23.84), which is formally smaller 
than the (semi classical) Boltzmann result by a power of kpt ~ D is, in 
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fact, divergent as to — ► 0. Thus, the starting point of the diffusive fixed 
point is unstable. In the next section, we will discuss the result of this 
instability in more detail but, for now, we observe that it suppresses the 
conductivity. Thus, it drives the system towards localization. In d = 2, 
when this correction is still weak (sufficiently high frequencies or, as we will 
describe below, temperatures) and growing only logarithmically, it is called 
weak localization. 

One might wonder whether is is valid to stop with the diagrams which we 
have calculated. Isn't it possible that if we computed some more diagrams, 
we would find other divergent corrections to the conductivity which might 
change our conclusions entirely? The answer is that this will not happen and 
we can stop here. The divergences which we find are due to the existence 
of slow modes in the system. The diffusive mode is a slow mode because it 
is related to the conservation of charge. The particle-particle mode (called 
a 'Cooperon') is slow because it is related to the diffusion mode by time- 
reversal symmetry. There is no other mode which is guaranteed to be slow by 
symmetry, so we do not need to worry about other entirely new divergences, 
although these same slow modes can lead to subleading divergences. We 
will discuss this point in the next chapter using an effective field theory for 
the slow modes of the system. 

A simple physical picture for weak-localization is given by the notion 
of coherent backscattering. The amplitude for an electron to go from r 
to r' is, a complicated function of the trajectory because of the possibility 
of repeated impurity scatterings along the way. Different trajectories from 
r to r' will therefore interfere with each other - sometimes constructively, 
sometimes destructively. However, a trajectory from r back to r itself will 
always interfere constructively with the time-reversed trajectory which is 
scattered by the same impurities, but in reverse order. Thus, there will be 
an enhanced backscattering amplitude from r back to r, compared to the 
amplitude to go to any other r'. This enhanced backscattering is the source 
of weak localization. Consider the probability for an electron at r to return 
to r within time ix,: 



where fx, = D/L 2 is the time by which the electron has diffused across the 
system. The correction to the conductivity is 




(23.85) 



P(t L ) ~ r 



l-d/2 _ / l-d/2 



(23.86) 



a 



Any effect which inhibits constructive interference, such as inelastic scat- 
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tering or a magnetic field will, therefore, spoil weak localization. Thus far, 
we have considered non-interacting electrons and neglected phonons, etc. If 
such sources of inelastic scattering are considered, they cut off the above 
divergences. At length scales longer than the Thouless length 

^Thouloss = \J Dt [ n (23.87) 

inelastic processes will thwart localization and cause ohmic conduction. For 
example, the 2D weak-localization correction is: 

Sa(u m ) = - JL ln(£ Thoulcss /^) (23.88) 

If r in ~ T~ p , then the frequency is replaced by ~ T p , not T, as one might 
naively assume. 

A weak magnetic field will also cut off localization at length scales longer 
than the magnetic length lu = ly/H. (Restoring the fundamental constants, 
it is £h = y/hc/eH.) At these length scales, the time-reversed trajectory 
differs in phase by ~ ir as a result of the magnetic flux which it encloses and 
no longer interferes constructively. Thus, in the weak localization regime, 
where 

Sa(u m ) = -^2 H^u/t) (23.89) 

the conductivity increases as the magnetic field is increased, i.e. the mag- 
netoresistance is negative. 

23.4.6 Weak Magnetic Fields and Spin-Orbit Interactions: 
the Unitary and Symplectic Ensembles 

23.4.7 Electron-Electron Interactions in the Diffusive Fermi 
Liquid 

How are the conclusions of the previous subsections altered by the influence 
of electron-electron interactions? How does the diffusive motion of electrons 
affect the relevance or irrelevance of interactions at the diffusive fixed point, 
as compared to their marginality at the clean Fermi liquid fixed point? As 
we will see in this subsection, diffusive motion is so slow that it enables 
electron-electron interactions to cause divergent corrections at the diffusive 
fixed point. 

The simplest example of such a divergence is in the single-electron Green 
function, from which we can obtain the single-electron density of states: 

N{E) = J Wf G{E ^ ] (23 - 90) 
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Figure 23.9: The renormalization of the density vertex by repeated impurity 
scatterings. The unarrowed solid line on the left is the interaction V. 

Consider the self-energy diagrams of figure 23.10. Let us assume that the 
electrons interact through some effective density-density interaction V(q). 
Then the main effect of disorder is to modify the nature of the interac- 
tion vertex. Physically, since the electrons move more slowly when the 
diffuse, they spend more time in close proximity and, therefore, interact 
more strongly. The basic density vertex is renormalized by repeated impu- 
rity scatterings according to (23.68). Hence, the interaction vertex V(q) is 
renormalized in a corresponding way, as depicted in figure 23.9: 

V(q) V{q)(T(q, e n , e n + ui n )f (23.91) 

where T(q, e n , e n +u> n ) is given in (23.68). There are two factors of T(q, e n , e n + 
u n ), one for each particle- hole pair which is coupled by V. 

Consider, then, the first diagram (exchange) in figure 23.10. Its leading 
contribution to the self-energy is: 



1 I d^q ( 1 1 \ 2 



1 



ie n + iuj m - ik+q + o T \ 
Clearly, the integral is dominated by u m ~ 0, q ~ so we write 



(23.92) 



v ( 1A IT [ ddq (I G{-e n e n+m ) ^ 1 

^ ] ~ ^-Hu + ^-J^J W\t \u m \ + Dq 2 

-2^ Sg n(n)I E /|^(l^^)(2,93) 



Consider the case d = 2. 

v / t \ -iV(0) , J r f 2 (Q{-£n£n+m) 
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Figure 23.10: The exchange (Fock) and Hartree diagrams contributing to 
the electron self-energy. Where particle and hole lines are connected by 
dotted lines, the vertices are dressed by repeated impurity scatterings so 
that they take a diffusive form. The unarrowed solid line on the left of the 
vertex is the interaction V. 



Let us suppose, without loss of generality, that e n < 0. Then the Matsubara 
frequency sum runs over to m > — e n . The ultraviolet cutoff is 1/r, since for 
higher frequencies the diffusive form is not correct. For low T, the sum can 
be approximated by an integral, and we find: 




(23.94) 



^ex(Cn) k) ~ —1 



V(0) 



sgn(n) ln(|e n |r) 



(23.95) 



(2tt) 2 Dt 



The correction to the density-of-states is 



SN ex (e) = 



d 2 k 



lm((G {k,e)) 2 X(e,k)) 



J (2vr)2 
V(0)N F 
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H\e\r) 



ln(|e|r) 



(23.96) 
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Thus, the correction is singular. 

The second diagram (Hartree) in figure 23.10 has opposite sign as a result 
of the closed fermion loop and an extra factor of two from the spin sum in 
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the loop. It has leading contribution: 



v , i \ „V f rf2 P d2 P' d2 1 mi ( I 6(-e n e n+m )\ 

SHa rtrC c(e n , k ) = - 2 -^jj^J^fJ^f nP-P ) [- {uJm \ +Dq 2 ) 



m 

1 1 



i^n ip + 2t„ i e n+m Cp+q + 2t„ 



+ m 

(23.97) 



As before, we set u) m = q = in the electron propagators; notice that the 
interaction is not at zero momentum, however. 

v / i \ 2^ f d 2 p d 2 p' d 2 q , fl 6(-e n e n+m ) \ 2 



m 

1 



1 1 

— — (23.98) 

The electron Green functions are all peaked around the Fermi surface, so 
we have 

ImSHartree(eri, k) PS 

(3^ J (2ttY\t \io m \ + Dq^J J 2vr V ; 

= ( 2 2 ^^ sgn(n)ln(|e ra |r) (23.99) 

where 

F = W) I ^ v( - 2pFCOS ^ ( 23 - 10 °) 

is an average of the interaction over all momenta connecting two points on 
the Fermi surface. 

Hence, adding the exchange and Hartree terms we have 

Since F < 1 (because V(q) decreases with increasing q), the exchange terms 
wins, and the net effect of interactions at lowest-order is to suppress the 
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conductivity. When this correction becomes large, \e\r <C 1, it is no longer 
valid to use lowest-order perturbation theory; but one can speculate that 
this suppression eventually leads to a vanishing of the density-of-states at 
the Fermi energy, analogous to the Coulomb gap discussed in the context of 
insulators. A similar correction is expected for the specific heat. 

The basic physics behind this correction is that the interaction between 
two electrons is renormalized by processes in which they diffuse back to each 
other and interact again: 

-l/e 

V{0) -► V(0) I 1 + / dtP(t) I (23.102) 



where P(t) ~ \/{Dt) d / 2 . In d = 2, this is: 

V(0) V(0)(l + cln(|e|T)) (23.103) 

The effects of interactions bear the signature of this correction. 

Thus, interactions lead to singular corrections to the density of states in 
d = 2 (and worse singularities in d = 1 (exercise)). This is in contrast to a 
non-interacting disordered system, in which the density-of-states is perfectly 
smooth. On the other hand, the conductivity receives singular corrections 
even in the non-interacting system. How do interactions affect this? Simi- 
lar, but more complicated diagrams than those of fig. 23.10 determine the 
lowest-order interaction correction to the conductivity of a diffusive metal. 
They give a contribution 

Sa(u) = V ^ F (l - F)\u(\u\t) (23.104) 
If the electrons interact through the screened Coulomb interaction, 

4we 2 

V(q) = A J~ (23.105) 
1 + 3p_n(g,w) 

and H(q, u) is the density-density correlation function. Hence, V(q — ► 0, uj = 
(n(g — > OjO)) -1 = iV^ 1 . This implies, for instance, that the conductivity 
correction is of the form 

<5<x scr .Coui.M = 2^2(1 - F)M\lu\t) (23.106) 

which is nearly identical in form and magnitude as the weak-localization 
correction. It is difficult to disentangle the two effects, but it can be done 
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by measuring the magnetoresistance since the weak-localization and interac- 
tion corrections are affected very differently by a magnetic field (the latter, 
through the Zeeman coupling; while the former is through the destruction 
of interference) . 

The diagrams which we have computed in this subsection are the lowest- 
order diagrams which have diffusively-dressed interaction vertices such that 
the particle- hole lines which are difffusing have low momentum/energy. One 
could imagine dressing the diagrams of figure 23.10 with impurity lines in 
other ways, but the diffusing particle- hole pair would not be near u = q = 0. 
Thus, while it may, at first glance, seem as if we have chosen an arbitrary 
set of diagrams, in fact, we have chosen precisely the ones which have the 
most singular diffusive corrections. If this seems a little unsatisfying, then in 
the next chapter we will introduce an effective field theory for the diffusive 
degrees of freedom, in which these corrections come from the only diagrams 
present at lowest order, so they can be obtained by simply turning the crank. 

23.5 The Metal-Insulator Transition 
23.5.1 Percolation 

Consider a disordered solid which is insulating as a result of impurities. 
Although it is insulating, there will be some isolated regions which are 'con- 
ducting'. In a system of non- interacting fermions, such regions would have 
single-particle states at the Fermi energy which extend across them but do 
not leak substantially into the surrounding insulating regions. One way to 
think about this is to consider the potential V(x), whose spatial variation 
is random as a result of impurities. The regions in which V(x) < €f are 
classically-allowed for electrons at the Fermi energy. Let us consider these to 
be the 'conducting' regions. The regions in which V(x) > €f are classically- 
forbidden for electrons at (or below) the Fermi energy. Let us call these 
'insulating' regions. 

Suppose we decrease the impurity concentration. Then the conducting 
regions will grow in size. As we continue to decrease the impurity concen- 
tration, the conducting regions will grow larger and larger and will begin to 
merge. Eventually, it will become possible to go from one end of the system 
to the other within one long conducting region which spans the system. If 
electrical conduction were purely classical and ohmic, this impurity concen- 
tration - which is called the percolation threshold - would be the transition 
point between metallic and insulating behavior. 
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However, electrical conduction is not classical. Even below the perco- 
lation threshold, electrons can tunnel quantum mechanically from one con- 
ducting region to another. Conversely, an electron can take two different 
paths from point A to point B, and the quantum mechanical amplitudes 
for these processes might interfere destructively. Consequently, a system 
might be insulating even above the percolation threshold. Nevertheless, at 
temperatures which are higher than the characteristic phase coherence tem- 
peratures of the system - so that coherent quantum tunneling cannot occur 
- but not so high that the conductivity of the 'insulating' regions (insulating 
and metallic are precise distinctions only at zero temperature; at finite tem- 
peratures, all systems have finite conductivity) is comparable to that of the 
conducting ones, we expect the conductivity and other physical properties 
to show behavior characteristic of percolation processes. Since percolation 
is, in a sense, the classical limit of the metal-insulator transition, we will 
discuss it briefly. 

There are two basic lattice models of percolation, site percolation and 
bond percolation. In site percolation, one randomly paints black some of the 
sites of a lattice and considers the properties of the remaining sites, which 
are colored red. We imagine that an electron can only be on a red site, never 
on a black one. It can hop directly from a red site to its nearest neighbors 
if they are also red, but not to other sites. Thus, if there is an isolated 
red site which is surrounded by black sites, then an electron on that site is 
stuck there. Sites which are red can be grouped into clusters of sites which 
are continguous with each other. One site in a cluster can be reached from 
any other site in the cluster by a sequence of nearest-neighbor hops entirely 
within the cluster. In bond percolation, on the other hand, one randomly 
removes bonds from a lattice. An electron can hop from one lattice site to 
a nearest-neighbor site only if the bond between them is present. Sites can 
again be grouped into clusters which are connected by bonds so that any 
site in a cluster can be reached from any other site in the cluster. 

For the sake of concreteness, let us consider bond percolation. When 
the average bond density, which we will call p, is low, clusters tend to be 
small. As the bond density is increased, clusters grow until the percolation 
threshold, p c , is reached. At this point, one cluster extends across the length 
of the system; it is called the percolating cluster. For p < p c , the average 
distance between two sites on the same cluster, £, scales as 

£~{Pc-p)~ u (23.107) 



For p > p c , the fraction, P, of the system which belongs to the percolating 
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cluster is 



Pip) ~ {P-Pcf 



(23.108) 



(For p < p c , P = because the largest cluster is finite even in a system of in- 
finite size.) Notice the analogy between these exponents and the correlation 
length and order-parameter exponents associated with critical phenomena. 

The precise value of p c depends on the particular lattice and whether it 
is bond or site percolation. However, the exponents v, (3, etc. are universal, 
and depend only on the dimension. 

One might imagine that the conductivity (assuming that bonds which 
are present are considered to be conducting and those which are absent are 
insulating) would be proportional to the fractions of the system belonging to 
the largest cluster. However, this is not the case because many of the bonds 
in the cluster lead to dead-ends which do not contribute to the conductivity. 
Instead, the conductivity is associated with a different exponent, //, (some- 
times called t) which describes the 'backbone' of the percolating cluster - i.e. 
the cluster with dead-ends removed (dead-ends are those bonds which do 
not have two completely independent routes to the two ends of the cluster). 



At the percolation point itself, the system will have power-law correla- 
tions. Many of the properties of the percolation point in two dimensions 
have been revealed using conformal field theory and, more recently with 
the use of the stochastic Loewner equation. For instance, v = 4/3 in two 
dimensions. 

However, as we pointed out earlier, percolation does not correctly de- 
scribe metal-insulator transitions because it misses quantum-mechanical ef- 
fects. In the remainder of this section, we discuss alternate approaches which 
take these effects into account. 

23.5.2 Mobility Edge, Minimum Metallic Conductivity 

Let us now consider a metal-insulator transition in a system of non-interacting 
fermions. Such a transition can occur as a result of varying either the chem- 
ical potential or the disorder strength. If the system is weakly-disordered, 
then the energy spectrum will still approximately break up into bands, al- 
though there may not be a true gap between the bands because there will be 
exponentially-small tails in the density of states. Near the bottom of a band, 
the states will all be localized because the kinetic energies of these states 
will be low. Near the top of the band, the same will be true. It is useful 



a(p) ~ {p-p c ) 



(23.109) 
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Figure 23.11: The density-of-states in a disordered system, with extended 
and localized states and mobility edges shown. 

to think of a band with chemical potential near the top as a nearly empty 
hole band; in such a case, the hole kinetic energies near the top of the band 
are low. Near the middle of the band, on the other hand, the states will 
be extended if the disorder strength is sufficiently small because the kinetic 
energies will be large in comparison. (If the disorder strength is too large, 
then even at the middle of the band the kinetic energy will not be large 
enough to overcome it.) What happens as the Fermi energy is swept from 
the bottom of the band to the middle? To answer this question, one should 
first note that there are no energies at which there are both extended and 
localized states because localization is not stable in the presence of extended 
states at the same energy. A small change in the particular realization of 
disorder (e.g. the locations of the impurities) would cause mixing between 
the localized states and the extended ones. When a localized state mixes 
with extended states, it becomes extended. 

Hence, there will be sharp energies which separate localized states from 
extended ones. Such an energy is called a mobility edge. As depicted in 
figure 23.11, the mobility edge separates the strongly- localized states with 
low kinetic energy in the band tails from the extended states with high 
kinetic energy in the center of the band. (One can imagine there being more 
than a single pair of such energies, but this would not be generic and would 
require some special circumstances.) 

As the chemical potential is moved through the mobility edge, E c , the 
system undergoes a metal-insulator transition. For Ep < E c , the zero- 
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temperature DC conductivity a vanishes. For Ep > E c , it is finite. What 
happens at Ep = E c ? Is it equal to some minimum non-zero value (Jmin? 
the minimum metallic conductivity? According to the percolation model 
described in the preceding section, the conductivity vanishes precisely at 
the transition, a ~ \Ep — E c \^. (Ep — E c or any other control parameter 
will act as a proxy for p — p c near the transition.) On the other hand, the 
following heuristic argument suggests that it is finite. The conductivity is 
given in Boltzmann transport theory by 

e 2 

a = A d — k d f 1 e (23.110) 

where I is the mean-free-path and A d is a dimension-dependent constant. 
(Here, we have restored the factors of e, h which were set to one earlier.) If 
scattering is due entirely to impurities, the mean-free path cannot be shorter 
than the inter-atomic spacing a in the solid. Since hp ~ ir/a and £ > a, 
kpt~-K. Hence, we conclude that 

e 2 

c- mi n = A d jk d F - 2 (23.111) 

In particular, in d = 2, the minimum metallic conductivity can be written 
entirely in terms of fundamental constants, cr m j n = A d e 2 /h. 

However, one should be cautious in applying the Boltzmann transport 
formula (23.110) - which is semiclassical and valid for large kpl - in the 
regime kpi ~ 1. Indeed, as we will see in the next subsection, the preceding 
logic fails in the case of non-interacting electrons. 

23.5.3 Scaling Theory for Non-Interacting Electrons 

Thouless had the important insight that one should study the scaling be- 
havior of the conductance. Consider a system of non-interacting electrons 
of size L. Thouless wrote the conductance of the system in the following 
form, which we encountered earlier (23.6): 

SE , 
g(L) = == (23.112) 

5E is the disorder-avergaged energy change of eigenstates at Ep when the 
boundary conditions are changed from periodic to antiperiodic. AE is the 
mean level spacing of eigenstates. (In order to derive such a relation ana- 
lytically from the Kubo formula, one must consider disorder-averaged quan- 
tities; otherwise, one could probably consider such a relation for a fixed 
realization of disorder.) 
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To understand this formula, suppose that D(L) is the diffusion constant 
for an electron at the Fermi energy. The time required for an electron to 
diffuse across the system is the Thouless time: 

t L = ^ (23.113) 

According to the uncertainty principle, 

5E ^ — ~ (23.114) 

Meanwhile, 



±E=— d (23.115) 

Hence, 

== = N F D(L)L d - 2 = aL d - 2 = g(L) (23.116) 

The second equality follows from the Einstein relation between the diffusion 
constant and the conductivity. 

We now study how g(L) scales as the system size is increased. We could 
increase the system size from L to bL by stacking b subsystems of size L. The 
key assumption of Abrahams, Anderson, Licciardello, and Ramakrishnan 
('the Gang of Four') is that: 

g(bL) = f(b,g(L)) (23.117) 

In particular, L does not appear explicitly on the right-hand-side. In a 
macroscopic Ohmic system, such a relation is obvious, g{bL) = b d ~ 2 g(L). 
However, the Ohmic scaling form is not exactly right microscopically. While 
AE(bL ) = b~ d AE(L) should hold generally, we need to find a relation for 
5E(bL). This relation might be complicated since we have to match the 
eigenfunctions from each size L block at their boudaries in order to construct 
the eigenfunctions of the size bL system. However, the sensitivity of the 
eigenfunctions to their boundary conditions is precisely what is encapsulated 
in g(L). Thus, it is very reasonable to assume that 5E(bL) depends only on b 
and g(L), and, therefore, that g(bL) does. As noted above, this assumption 
holds in the ohmic regime. It also holds in the localized regime, where 
g(L) = g§e~ L l^\ g(bL) = go (g(L) / go) b . The scaling assumption of the 
gang of four was that the form (23.117) holds throughout the entire region 
between these limits. 
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If (23.117) holds, then we can define 
ding 1 / df 



dlnL g \db J b=1 



0(g) (23.118) 



In strongly-disordered systems, g is small, and all states are localized. In 
this limit, we expect (3(g) = ln(g/go). In the clean, or large g, limit, we 
expect Ohmic scaling, and /3(g) = d — 2. 

How do we connect the two limits? For g large, we can compute (3(g) 
in powers oil/g or, equivalently, l/kp£. In the previous section, we have 
already computed the first correction in 1/fcp^ to the conductivity. Suppose 
we reconsider this calculation in a system of finite-size L. Then 1/L is 
an infrared momentum cutoff, and we can take the other cutoffs used in 
the previous section (frequency 1 / y^uj/D, Thouless length ^DTi n , magnetic 
length,...) to infinity. We thus obtain a(L) in d = 2 for large hpi. For 
d = 2 + e, we can compute in powers of e. Since g = a in d = 2, 

9(L) =*( 1 -2i ln (i)) (23 ' U9> 

For d = 2 + e, corrections to this formula for a are 0(e), which we drop. 
Differentiating, we obtain (3(g). 

m=e ~2^~g + --- (23 ' 120) 

where . . . represents O (l/ff 2 ) terms and higher-order in e terms have been 
dropped. This is interpolated with the small g limit in figure 23.12. 

There are several consequences which follow immediately. (1) There is no 
metallic state of non-interacting fermions in d < 2 (and, therefore, no metal- 
insulator transition, of course). (2) There is a metal- insulator transition with 
no minimum metallic conductivity in d > 2. (3) For d = 2 + e, the metal- 
insulator transition occurs at g ~ 1/e, which is in the perturbative regime. 
The first statement is clear from figure 23.12: since (3(g) < for all g, the 
conductance always flows to the localized regime of small g. The second 
statement follows from the positivity of (3(g) for large g. In order to connect 
to the negative (3(g) localized regime at small g, (3(g) must pass through zero 
at some g c . For g > g c , (3(g) > and the conductance flows to large values, 
where it is Ohmic; this is the metallic state. For g < g c , (3(g) < and the 
conductance flows to small values; this is the insulating state. For g = g c , 
the conductance remains constant; this is the transition point. However, if 
the conductance remains constant as the system size L goes to infinity, then 
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Figure 23.12: The /3-function of Abrahams, et al. which is obtained by 
smoothly connecting the Ohmic limit with the localized one. 

the conductivity a c = g c L 2 ~ d — > 0. Hence, there is no minimum metallic 
conductivity. 

The third statement follows immediately from (23.120). We can say 
more about the critical region by expanding the (3- function about g c : 

m ~ \ (^) (23 ' 121) 

From our /3-function (23.120), we see that \/v = e and g c = l/47r 2 e, where 
e = d — 2. Since the ... in (23.120) can be neglected only for g large, we 
can trust our g c = l/47r 2 e conclusion only for e small, i.e. only near two 
dimensions. 

Suppose \g — g c \ is small at the microscopic scale i. The linearized 
approximation to the (5- function is applicable. Let us integrate until \g — 
9c\/Qc ~ 1- We define the length scale at which this is reached as £: 

(23.122) 

This is an important length scale because the system will cross over from 
critical behavior at short-distances to Ohmic metallic (g > g c ) or localized 
insulating (g < g c ) behavior at longer length scales. On the insulating side, 
£ is the localization length. 

On the conducting side as well, £ is a quantum-classical crossover length. 
For length scales longer than £, electrical conduction is Ohmic and, therefore, 
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essentially classical. For length scales shorter than £, quantum interference 
effects can occur and classical ideas are inappropriate. Since the conductivity 
is independent of length in the classical Ohmic regime, it is equal to 

(23.123) 

The vanishing of the minimum metallic conductivity can be understood as 
the divergence of £ at the metal-insulator transition. 

Thus, the scaling hypothesis of Abrahams, et. al is equivalent to the 
assumption that there is a single length scale, £, which becomes large near 
the metal-insulator transition. This is completely analogous to the case of 
classical thermal phase transitions studied in chapter .. or quantum phase 
transitions studied in chapter .. However, there is an important difference: 
a metal-insulator transition of non-interacting fermions is not a thermo- 
dynamic phase transition. While the conductivity changes sharply at the 
transition, all thermodynamic properties, such as the ground state energy, 
are perfectly smooth across the transition. The latter follows from the 
smoothness of the density-of-states across the transition (which, in turn, 
was proved by Thouless) . A simple way of seeing why it is smooth is to con- 
sider the disorder-averaged single-particle Green function. It is short-ranged 
on both sides of the transition and does not show any qualitative difference 
between the metallic and insulating states, so it (and the density-of-states 
which is derived from it) varies smoothly across the transition. For instance, 
weak- localization effects, which suppress the conductivity, do not affect the 
single-particle Green function, as one may see by drawing diagrams which 
might renormalize the single-particle Green function. Within the effective 
field theory approach of the next chapter, this will be a little more obvious. 

In an interacting system, however, a metal-insulator transition might be 
a true thermodynamic phase transition. Because interacting disordered sys- 
tems are technically challenging, we will postpone further discussion of this 
issue to the next chapter, after we have developed the appropriate effective 
field theories. 

23.6 The Integer Quantum Hall Plateau Transi- 
tion 



a = 



-A~2 
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Non-Linear cr-Models for Diffusing Electrons and Anderson 

Localization 



24.1 Introduction 

In this chapter, we will present a formalism which will elucidate the results 
of the previous chapter and cast them in the form of a field theory which 
brings out analogies between the metal-insulator transition and other critical 
phenomena. The basic physical observation is that the only low-energy, 
long-wavelength correlation functions (after disorder-averaging) are density- 
density, spin-spin, etc. correlation functions which are diffusive in form 
as a result of a conservation law. All other disorder-averaged correlation 
functions decay exponentially. Hence, we construct an effective field theory 
for the low-energy, long-wavelength correlation functions at the diffusive 
Fermi gas fixed point of figure 24.1. (However, it is worth remembering that 
there is still a constant, non-zero density of states, which reflects the fact 
that there is still a Fermi surface's worth of single-particle states near the 
Fermi energy. The 'mass term' in the Green function, G~ l = e — ^ + i/2r is 
imaginary. It's not that there are no low-energy states, as would be the case 
with a real mass term. Rather, it is that there are low-energy states, but 
they are not momentum eigenstates. These states show up in, for instance, 
the specific heat. Upon disorder-averaging, however, only the diffusion of 
conserved currents remains at long- wavelengths. Thus, we would like an 
effective field theory which directly gives us this diffusion physics without 
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Figure 24.1: Renormalization group flow diagram for a Fermi gas in the 
presence of disorder. The clean fixed point is highly unstable to disorder 
and flows into the diffusive metal. When the disorder strength becomes 
sufficiently strong, a metal-insulator transition occurs, beyond which the 
flows are into the Anderson insulating fixed point. 



the encumbrance of the low-energy single-particle states.) 

One might be tempted, at first, to construct an effective theory built on 
the density operator, since it has long-ranged correlations. Let's imagine 
trying to do this. The first problem which one must confront is that the 
density satisfies the diffusion equation, so we would expect it to have an 
action like 

S[p]= f d d xdtp(d t + V 2 ) p (24.1) 



However, the time-derivative term is a total derivative, so we can drop it 
from the action! 

Consider, also, the problem of interaction terms. The density should 
have self-interactions - perhaps of the form p 4 - since these must gener- 
ate the quantum interference effects which we computed in the previous 
chapter. Also, there should be some coupling of the density to the electron 
operator, presumably of the form p^jpij). However, there are problems with 
such terms: for instance, the latter would induce singular corrections to 
the electron Green function, which shouldn't be present in the absence of 
electron-electron interactions. 

Thus, we see that a simple-minded approach to an effective field theory 
won't work, but, as we will see later, the general philosophy is correct. In 
order to endow our theory with the necessary properties, we will need some 
more formalism, to which we turn in the next section. 
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24.2 The Replica Method 

Let us return now to the problem of performing the disorder average of the 
free energy. 

F = J W F[V] e™ J dx v 2 (x)/2mv 2 
= J VV In Z[V] e~ I ^xV 2 (x)/2 ni v 2 

While Z[V] has a functional integral representation, In Z[V] does not. Hence, 
we cannot write, say, an effective action for the disorder-averaged system. 
However, we now observe that 

In Z[V] = lim - (Z n - 1) (24.3) 

n^O n 

This is a useful expression because Z n has a simple functional integral rep- 
resentation if n is an integer: we just imagine that we have n independent 
copies of the system. Thus, apart from a constant 1/n term, the free energy 
is 

~F = lim — / VV nz^Ztyte-E^SWa^V] e ~ f d<*xV 2 (x)/2 ni v 2 
n— >0 n J 

a=l 

= J pV^Vti VVe-X^iWaAM e -fd«xV 2 (x)/2 ni v 2 y 24A) 

The replica index a = 1,2, ... ,n labels the n identical copies of the system. 
Hence, we have a disorder-averaged action: 

e -5 avg [Va,Vi] = J 2)7 e -E:=i%^.Vl e - / d d xV 2 (x)/2 ni v 2 ^ 24 _ 5 ^ 

After V has been integrated out, the n independent ip a s become coupled 
together. 

This method for averaging the free energy is called the replica method, 
and the limit n — > is called the replica method. There is some sleight 
of hand involved here since we can do the calculation for integer n but we 
then take the limit n — > 0. However, within perturbation theory, there is 
no difficulty in taking this limit; it is simply a bookkeeping device which 
discards unwanted diagrams with closed electron loops. Beyond perturba- 
tion theory, little is known about the validity of taking the replica limit; one 
usually proceeds without answering this question and hopes for the best. 
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Consider, for example, a system of non-interacting electrons moving in 
a random potential: 

Sty, ^t] = J dr d d x ^ (d T + ^- V 2 + iijij> 

+ J dTd d xV{x)ipl(x, T )ip a {x,T) (24.6) 

Following the replica procedure, we find that 

i r n 

F = - / JJP^ a P^t e -Sav g [V ,4] (24.7) 

where 

+ n iV 2 J dTdT'd d xipl(x,T)Mx,T)^l(x,T')Mx,T') (24.8) 

The second term, which results from the disorder average, is non-local in 
time (reflecting the time-independence of the disorder) and couples different 
replicas. This action has both figures 23.4a,b but with a free summation 
over a replica index in 23.4b so that this diagram is proportional to n. In 
the replica limit, this diagram vanishes. 

The RG perspective on the diffusive Fermi liquid which we introduced 
in the previous chapter applies directly to this replicated action. 

Mention Supersymmetry, Schwinger-Keldysh 



24.3 Non-Interacting Electrons 
24.3.1 Derivation of the cr-model 

We will focus on interacting electrons or, at least, on methods which can 
be applied to both non-interacting and interacting electrons. There are tow 
reasons for this: (1) there is a great deal of formalism which has evolved to 
describe non-interacting electrons, which is beyond the scope of this book 
and is well-described in books and review articles, and (2) in the real world, 
electrons interact, and these interactions are evidently important in two- 
dimensions, as we saw when we studied perturbative corrections in the pre- 
vious chapter. Hence, we focus on interacting electrons in the presence of 
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disorder. However, the problem of non-interacting electrons in under much 
better control, and the theory is on much safer footing. Some of the analo- 
gies with other critical phenomena can be made more precise. Hence, we 
introduce this theory briefly and comment on its physical interpretation, 
before moving on to the interacting case. 

In order to compute the conductivity or density-density correlation func- 
tion, we need to compute the product of an advanced and a retarded electron 
Green function at the chemical potential. Such products of an advanced and 
a retarded Green function have their phase cancelled, so that impurity aver- 
aging does not make them short-ranged, unlike the electron Green function. 
For instance, the advanced Green function is the complex conjugate of a 
retarded Green function at the same energy, so 

, \ 2 

tf;'(E + id, x)if)(E + id, x 



^{E + id, x)ij}{E + id, xj) U\E - iS, x)ij}{E - id, x)) (24.9) 



These Green functions can be obtained from the following functional inte- 
gral: 



/ 



^{E + id, x)ip{E + id, x)J (^{E - id, x)ip{E - id, x)J = 

2^ R (x) PVi(x) Vij, A {y) (^(x)^(x) ^(x)Va(x)) 

(24.10) 



e -Se,e 



where 



Se,e> = J d d x (e + id + ^V 2 ) Vij + V(x)^ R (x)i, R {x) 

+ tf A (e' -id + ^V 2 ) ^ A + V(x)^ A (xW A (x)j (24.11) 

In order to perform the disorder average, we use the replica method. We 
introduce n copies of the functional integral 

„ n 

Z n [V] = J n^(x)^(x)^aA(x)^i A (x)e- 5 -,-' (24.12) 
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where 



SnE,E> = J d d X (Vl* (E + iS + ^V 2 ) ^ aR + V(x)^ aR ( X )^ aR (x) 

+ (e' -iS + ^V 2 ) VaA + WoaO*)) (24.13) 

We now average over disorder: 

~Z~n = J ^y e ~ S d 1 d xV 2 {x)/2n i v 2 2 n \y\ 
n 

n^aflW ^l R (x) PVaA(x) ^L(x) e"^"' (24.14) 



/ 



a=l 



with 



£ + + — V 2 



[V»lfl' ^aR> ^aA^aA] = j d d X 

+ ^(^-^ + ^v 2 )^ aA 

^ (V'lfiV'aR + ^LV-oa) (^6* + ^IaAa) } (24.15) 



+ n; 



We introduce a capital Latin index B = R, A, so that this can be re-written 

S E+oj/2,E-u/2^aB^aB\ = 

J d d x \^\ B (d BC [e + ±- V 2 ) + Abc (| + «) ) ^aC 

+ n^ 2 Vj B V'bs ^cV'cc] (24.16) 

where Abc = diag(l, —1). 

If we were to compute perturbatively with this effective action, we would 
recover the results of the previous chapter. However, as we noted earlier, 
the final term, is a strongly-relevant perturbation. It is no-local in time or, 
equivalently, it is a function of only two, not three energies. Thus, we should 
not expect perturbation theory to be valid for this term. Indeed, we did not 
us low-order perturbation theory in the previous chapter, but had to sum 
an infinite set of diagrams in order to obtain physically-reasonable results. 
Here, we will deal with this relevant perturbation by decoupling it with a 
Hubbard-Stratonovich transformation. 
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There are two choices of Hubbard-Stratonovich decomposition. (We will 
make the simplification of ignoring spin in this section.) We can either in- 
troduce a Hubbard-Stratonovich field which is essentially equal to ^l^bB 
or one which is equal to V^bVcC- ^^ ie f° rmer i s J us t the density of electrons 
at energy E + oj/2 plus the density of electrons at energy E — u/2. If it 
develops an expectation value, it is just a shift of the chemical potential. 
The latter, however, is non-local in time (since it is a function of two en- 
ergies, E ± u/2) and can, in principle, have non-trivial structure in replica 
and advanced/retarded indices. Thus, we decouple the quartic term in this 
channel. In so doing, we re- write the final term in (24.16) as: 



ii B ^bB ^Ic^cC = -^bB^cC ^cC^ 



bB 



(24.17) 



We introduce a Hubbard-Statonovich field Qbc,BC by rewriting the par- 
tition function as: 



\{ViP aR (K)V^ aR (K)VQ bc , BC 

a=l 



(24.18) 



where 



Sm[Qbc;BC^i B ^aB\ = 

J d d x [vd B (s BC (e + ^V 2 ) + A BC (f + <*)) 



1 " -t ./, i wN F n n 

Hbc;BC<^cb;CB 



+ 7^Qbc;BC % B ^cC + |r 



(24.19) 



Note that the minus sign in equation (24.17) resulted in the i in this equation. 

To proceed, we will integrate out the electrons and look for a saddle-point 
for Q. Integrating out the electrons, we find: 



-/ 



VQe 



-Seff[Q,: 



(24.20) 



where S c s[Q,] is given by 

S eS [Q] = 



I 



d d x 



-tr 1.[6bc [E + -V 2 J + Abc (f + is) + ^Qbc 

ttN f 



+ 



4r 



-tr (Q 2 ) 



(24.21) 
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If we assume a translationally-invariant saddle-point, the saddle-point equa- 
tion is: 

d 2 p 1 



(27T) 



(24.22) 



It has solution 



Q bc; i?c = Abc 4c (24.23) 
At this saddle-point, the electron Green functions are of the expected form: 

1 



G B=RA (k,E) 



1 

P k 2 I 

^ 2m 31 2r 



(24.24) 



Thus, the saddle-point has self-consistently computed corrections to the 
propagator and summed them to obtain the self-energy correction. 

We can now obtain the spectrum of fluctuations about the saddle-point 
by expanding (24.21) about Q = A: 



S cS [Q] = 

S ddx 



-tr ln(^ (e + i-V^) + A BC (| + ± j + -*Q BC 
+ ^tr W A) + ^tr(«5Q 2 )' 



2r 



4r 



-/A [-*!„(< 



tr In I Gsc" 1 + +— 5Q BC 



2t 



+ ^tr(5QA) + ^tr(^ 2 ) 



2r 



4r 



(24.25) 



where G RR = G R (E + f), G A A = GU (£7- f ), and Gj^ 
Expanding the logarithm to second order, we have: 



S cS [Q] = J 



d d x 



-■^tr (GSQGSQ) + ^tr (5Q 2 ) 



8r 2 



4r 



Gar = 0. 



(24.26) 



The linear terms in 5Q vanish by the saddle-point condition. Consider the 
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first term in equation (24.26): 

tr(G5QG5Q) = J ^ ^ (G R ( P + q)5Q RA (q)G A ( P )5Q AR (-q) 

+ G A (p + q)SQAR{q)G R (p)6Q RA {-q) 
+ G R (p + q)5Q RR (q)G R (p)5Q RR (-q) 

+ G A (p + q)SQ AA (q)G A (p)5Q AA (-q)) (24.27) 

In the previous chapter, we computed the p integrals in imaginary time. 
Continuing to real-time, we have 

/ Wf GR{P + q > E + ^ G *(P> E - f ) = 2 * N ft (1 - iur - Drq 2 ) 
,! " P ■ G R (p + q,E+ %)G R {p, E+^-)= (24.28) 



/ 



(2vr) d n ^ y ' 2 ' 2 



and similarly for the other two integrals. Thus, we have to second order in 
SQ: 

ScslQ] = ^ f d d x (Dti(V5Q RA f + Dtv(VSQ AR ) 2 

+ iutv(8Q 2 RA + 5Q 2 AR ) +i tr {6Q 2 RR + 6Q 2 AA )^ (24.29) 

To this order, Q RR and Q AA are massive fields, while Q RA , Q AR are massless 
for uj = 0. Since Q RR , Q AA are non-zero at the saddle-point, one might say 
that the longitudinal fields are massive, as in a non-linear u-model, while 
the transverse fields, Q RA , Q AR are massless. As we will see momentarily, 
this interpretation is correct. 

Suppose we would like to compute S e ff[<3] t° higher orders. We could 
continue to expand in powers of 5Q, but this is tedious. A much sim- 
pler way is to identify the symmetries of the action; these will constrain 
the form of <S c ff[Q] completely. Inspecting (24.16) for u, we see that any 
transformation of the form ipi,B — > U^BC^cC wu l leave (24.16) invari- 
ant so long as ul b . CB Ubd;BD = &cd$CD- Under such a transformation, 

Qbc;BC ~^ ul d . BD Qde;DEU eC ;EC- Since such a transformation must leave 
the action invariant, (24.23) is not the only saddle-point of the action. In 
fact, the entire manifold ul d . BD K.DEUd C ;EC extremizes the action. Thus, the 
effective action S e s [Q] must vanish for any Q of this form. Configurations 
in which Q varies slowly within this manifold must be gapless, since they 
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have only gradient energy. Small but non-zero to + id gives these excitations 
a small gap because it favors Q = A. Meanwhile, configurations in which 
Q moves out of this manifold should cost energy; this is why we found that 
dQun, <5Qaa are massive while SQra, 5Qar are massless. 

The above paragraph is almost correct. One gap in these arguments 
is that the unitary transformations Ubd,BD are not the only symmetries of 
the action (24.16). As a result of time-reversal symmetry, the equations 
satisfied by ip are the same as those satisfied by ift . In other words, we 
should not only allow transformations which mix the different ipbB^, but also 
those which mix ipbB with i^ cC so long as they leave iP\qiPcC invariant (and, 
therefore, leave (24.16) invariant for to + id = 0). These transformations can 
be parametrized by defining 

( XbB1 \ = ( ^ bB \ 

V XbB2 J V 4b J 

Then we can write 

i'bB^bB = 2 XbBiJijXbBj 

where Jij = e-ij. Hence, all transformations 

XbBi —> M bc -BC;ijXcCj 

are symmetries of (24.16) for uj + id = so long as 

M bc;BC;ij Jjk M cd;C D;kl = Jildbd^BD (24.33) 

or, simply, 

M T JM = J (24.34) 

where J = JudbddBD- Equation (24.34) is the defining equation of the 
symplectic group Sp(2n). 

Sp(2n) does not act in a simple way on Qbc;BC as we defined it above. 
However, we can rewrite 

^bB^bB ^Ic^cC = \ XbBiJijXbBj XcCkJklXcCl = ~ ^ XbBiJijXcClXcCkJklXbBj 

(24.35) 

and decouple this term with Q so that it becomes 

7^Qbc;BcXbBiJijXcCl + -^rQbc;BcQcb;CB = ^X JQX+ —^{Q ) 

(24.36) 



(24.30) 
(24.31) 
(24.32) 
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With this definition, Q must transform so that these terms remain invariant. 
In particular, the first term in (24.36) transforms as 

X T JQX - X T M T JQ'M X = x T JM- 1 Q'M X (24.37) 

In the first equality, we have used (24.34). This will remain invariant if 

Q' = MQM- 1 (24.38) 

which also leaves the second term in (24.36) invariant. 
Hence, the saddle-point manifold is given by 

Q = MKM 1 (24.39) 

This manifold is parametrized by Sp(2n) matrices M. However, not every 
different M leads to a different Q. Any M which doesn't mix retarded fields 
with advanced ones commutes with A. Therefore the corresponding subset 
of transformations Sp(n) x Sp(n) C Sp(2n) don't lead to new configurations. 
In other words, Q € Sp(2n) / Sp(n) x Sp(n). For such Q, the action is simply: 

Scff [Q] = ^ J d d x (Dtr(VQ) 2 + (ico - 5) tr(AQ) + . . .) (24.40) 

where the . . . refers to higher-derivative terms. The second term has been 
written in the above manner because non-zero uj + iS favors Q = A, which 
is equivalent to the condition Qra = Qar = which we had in the second- 
order expansion of S c g [Q] . 

The effective action S c s[Q] describes a non-linear cr-model. The main 
complication is that instead of a field which takes values on the sphere 
S 2 = 0(3)/ 0(2) (as in an antiferromagnet), we have a field which takes 
values in a different coset, namely Sp(2n) / Sp(n) x Sp(n). 

If we consider a system of electrons in a weak magnetic field, then trans- 
formations which mix ip and ^ are no longer symmetries of the action. The 
symmetries of the action 

/ d d x [vl B (sbc (e + ^(V - zA) 2 ) + A BC (| + tS^j i> aC 

+ mv 2 4bAb Vi^cc] ( 24 - 41 ) 

are U(2n) symmetries ip cC — > U c d-cD^PdD Decause th e first derivative term 
A • V + V • A is not invariant under the the rest of the Sp(2n) group. Hence, 
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a system of electrons in a weak magnetic field ('weak' means too weak for 
the quantum Hall effect to occur) will also be described by a NLcM of the 
form (24.40), but with Q taking values in U(2n)/U(n) x U(n). Systems in 
this universality class - characterized by the absence of time-reversal sym- 
metry - are said to be in the 'unitary ensemble' because of the unitary 
symmetry group U(2n). Meanwhile, systems with time-reversal invariance, 
as we discussed above, have symplectic symmetry group Sp(2n) and said to 
be in the 'orthogonal ensemble'. Finally, systems with spin-orbit scattering 
have symmetry group 0(2n) (see ref.) and are said to be in the 'symplectic 
ensemble'. The reason for this confusing terminology is that the method 
which we have used in this section is called 'fermionic replicas' because we 
could have used bosonic fields to construct the functional integral (24.10). 
For non-interacting electrons, thee two representations are equivalent. How- 
ever, the symmetries of the bosonic action are different, and we would have 
found NLcrMs for the cosets 0(n,n)/0(n) x O(n), U(n,n)/U(n) x U(n), 
Sp(n,n)/ Sp(n) x Sp{n) for the orthogonal, unitary, and symplectic cases, 
respectively. However the /3-function for 0(n,n)/0(n) x 0(n) is the same 
as the /^-function for Sp{2n) / Sp{n) x Sp(n), etc. so the same physics is 
obtained. 

24.3.2 Interpretation of the cr-model; Analogies with Classi- 
cal Critical Phenomena 

Superficially, our NLuM 

S c{i [Q} = ^ J d d x (Dtr(VQ) 2 + (ico - 5) tr(AQ)) (24.42) 

looks very similar to the NLuM which describes an classical ferromagnet or 
antiferromagnet near its lower critical dimension, d =2. 

S cS [n] = ^Jd d x (^tr(Vn) 2 + H • n) (24.43) 

In both cases, there is a Goldstone phase. In the former case, this is the 
metallic phase, with diffusion propagators (—iuj + Dq 2 )^ 1 ; in the latter case, 
this is the magnetically-ordered phase, with spin-wave propagators (H + 
p s q 2 )~ l . In both cases, there is a critical point in d = 2 + e which is near 
the Goldstone phase's stable fixed point. In the former case, it is the metal- 
insulator transition; in the latter case, it is the Curie or Neel point. However, 
the similarity ends here. In the latter case (24.43), there is a disordered phase 
in which the symmetry is restored and the order parameter vanishes. While 
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the same is true for any finite n in (24.42), this does not quite happen in 
the replica limit, n — > 0. 

The basic symmetry (replicated and enmeshed with particle-hole sym- 
metry) is between advanced and retarded Green functions. If Q were to 
vanish and we were to take iS = 0, these Green functions would be precisely 
the same. 

G B=R , A (K E) = * (24.44) 

h ~ 2^ + *fQbB 

However, if they were precisely equal, the spectral function and, therefore, 
the density-of-states would vanish. Hence, the density-of-states plays the 
role of an order parameter. Diffusion modes are slowly- varying oscillations 
of this order parameter which remain within the low-energy manifold Q = 
MAM' 1 . 

However, the density-of-states is non-zero in the insulating state as well 
and varies smoothly through the transition, in contrast to the magnetization 
at a magnetic transition. As we will show in the next subsection, the critical 
exponent for the order parameter vanishes, (3 = 0, so 

p{E) ~ - tr(AQ) 5 ^ + ~\E- E c f ~ const. (24.45) 

Thus, the symmetry is always broken. 

McKane and Stone, who pointed this out, explained this as a special 
feature of disordered systems. If E is in the regime of localized states, then 

\G(x, y;E + i5)\ 2 - ^5(E - E t ) |^(x)| 2 \Mv)\ 2 (24.46) 

for 5^0. Where the E\ are the energies of localized states. This follows 
from 

G (,, s ; E + «)=y;«M (24.47) 

n n 

and the fact that the cuts due to a continuum of extended states can be 
deformed away from the integration contour in (24.46) while the isolated 
poles due to localized states cannot be avoided. Therefore, 

(QRA(x)Q A R(y)) = \G(x,y;E + i5)\ 2 ~ ± e ~ 2 ^^ (24.48) 



where L is the system size. This factor comes from the spatial density of 
localized states / in (24.46). Hence, the correlation function is divergent 



CHAPTER 24. NON-LINEAR a-MODELS FOR DIFFUSING 
462 ELECTRONS AND ANDERSON LOCALIZATION 



for arbitrary separation as 5 — > 0. Fourier transforming, we see that it is 
divergent for all p: 

(Qra(p)Qar(-p)} ~ I 9{p) (24.49) 

where g(p) is a smooth function of p which is finite at p = 0. On the other 
hand, if E is in the regime of extended states, then we have diffusion: 

(Qra(p)Qar(-p)) ~ ^2^5 (2450) 

Comparing these two expressions, the localized phase has D — > 0, but not the 
restoration of symmetry or the dynamical generation of a mass. Compare 
this to the 0(3) NLerM which we studied in the context of antiferromag- 
netism. There, the correlation function in the disordered state is of the 
form (n(p)n(— p)) ~ l/(Dp 2 + m 2 ); a mass term is generated and the spin 
stiffness D (the analog of the diffusion constant) is irrelevant compared to 
it. 

Therefore, the (Qra(p)Qar(—p)) correlation function is divergent in 
both phases. However, it is divergent only at p = in the metallic phase, 
but divergent at all p in the insulating phase. In the latter case, there 
are no long-ranged correlations or long-wavelength excitations, but neither 
is there a gap - the density of states at the chemical potential is finite. 
This situation should be viewed as a generalization of Goldstone's theorem. 
When a symmetry is broken, there can be no mass gap because the order 
parameter is susceptible to infinitesimal perturbations. There are two ways 
in which this can happen: (a) the conventional way - the gapless states 
are Goldstone modes - or (b) they can be localized excitations. Thus, the 
basic symmetry between advanced and retarded Green functions - which 
differ only by the iS in their definition - is broken, so that infinitesimal 5 
leads to a finite difference between then. The symmetry is broken in both 
the metallic and insulating phases; the difference is that (a) occurs in the 
former while (b) occurs in the latter. 

24.3.3 RG Equations for the NLaM 

We will now compute the one-loop RG equations for the NLuMs introduced 
in this section, thereby recovering the Gang of Four's scaling equation. In 
order to do so, we need a convenient parametrization for Q. One simple one 
is: 

^ r 0)1/2 ^ 
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where V is an unconstrained n x n matrix. For Sp(2n) / Sp(n) x Sp(n), the 
elements of V are quaternions and f means take the transpose and perform 
the quaternion conjugate. For U(2n)/U(n) x U(n), the elements of V are 
complex numbers and f means the adjoint. For 0(2n)/0(n) x 0(n), the 
elements of V are real numbers and f means the transpose. 

In order to compute the one-loop /^-function, it is sufficient to expand this 
action to quartic order and, therefore, sufficient to expand Q to quadratic 
order in V. The result is simple to state. Let t = 1/D and let G be the 
symmetry group of the NLcrM. Then 

P(t) = et-Ct 2 + ... (24.52) 

where C is the Casimir (in the adjoint representation) of the symmetry 
group G. In particular 

0S P (2n)(t) = et- (2n + l)t 2 + ... 
Pu(n) (*) = et-nt 2 + ... 
Po(n)(t) = et - (n - 2)t 2 + ... 

(24.53) 

Note that the /3-function does not depend on the subgroup which is being 
modded out. Instead of Sp(2n)/Sp(n) x Sp(n), we could just as easily take 
Sp{2n) / Sp{p) x Sp(2n — p), and we would obtain the same result. 
For n — ► 0, this give us 

P(t) = et-t 2 + ... (24.54) 

for electrons in zero field, which is equivalent to the /3-function which we 
found in the previous chapter. In the presence of spin-orbit coupling, the 
/3-function has sign reversed, while in the presence of a weak magnetic field, 
the one-loop /3-function vanishes, and we must go to two loops. 

24.4 Interacting Electrons 

We now turn to the problem of interacting electrons. We can no longer 
consider individual frequencies in isolation. As a result, there is no longer 
an exact symmetry of the theory which we can exploit. Hence, there is no 
completely controlled route to an effective theory for diffusion modes. Such 
a theory must be some form of NL<rM in limit that the interaction strength 
vanishes. In the absence of a clear symmetry justification, we must rely 
on physical arguments: clearly the low-energy degrees of freedom should be 
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diffusion modes (at least for weak interactions), so we should write down an 
effective field theory which captures their physics. It turns out that the most 
straightforward way to do this is to construct a field theory with infinitely- 
many terms in its Lagrangian, all of whose coefficients are related. Unlike 
in the non-interacting case, we can only hope that the relationship between 
these terms is preserved. At one-loop order this holds. Furthermore, the 
theory contains terms which are completely non-local in time - such as the 
non-interacting electron NLuM (which is at a fixed energy and, therefore, 
is completely non-local in time) - and also terms which are local in time. 
In principle, terms which are only somewhat non-local in time could be 
generated. At one-loop, this, too, does not occur. 

We begin with a system of interacting electrons moving in a quenched 
random potential V(x) in 2D. For simplicity, we consider the unitary en- 
semble. The imaginary-time action is 

S = Sq + ^disorder + Sint (24.55) 

where 

So ~i~ ^disorder — 

T E / SX ViaW ( ie * + + » + ^A*) (24-56) 

and 

Sint = T J Fl V>n,a( k + q)^m,a(k)^(p - q)^+ m _ n ,/3(p) 

n,m,l 

+ T 2 4, >Q (k + q)^ m , a (p)^ t , /3 (P - q)Vi+ m -n,/3(k)] (24.57) 

For later convenience, we have written the second term in momentum space 
and have split the interaction into nearly forward scattering, in which elec- 
trons with momenta and spins (k, a), (p, 0) are scattered to (k',a), (p',/3), 
where k' ~ k, p' ~ p and exchange scattering, in which k' ~ p, p' ~ k. 
If the microscopic electron-electron interaction is V(c[), then the associated 
bare couping constants are Ti = V(0), T2 = J(d9/2i:) V(2kpcos9/2). As 
Finkelstein noted, these are subject to finite Fermi liquid renormalization in 
the ballistic regime, even before they begin to flow in the diffusive regime. 
We assume that the system is in a weak magnetic field, so that we can ignore 
Cooper scattering, for which ki « — k 2 , k3 f» — k4. 
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In order to perform the average over the quenched random potential, we 
replicate the theory and then average the replicated functional integral over 
the probability distribution V(x)V(x') = 27T n f t <^( x ~ x 0- Now, we have 

5-p = r£ [ d 2 x (^ + ^ v2 +^W(*) +i ^ (etc,.) 2 ] 

n;a J \ / * 

/Pi <L(k + q)<«( Wj(P " q)^f+™-„^(p) 

n,m,l;a 

+ T 2 <t a(k + q)^« )Q (p)^ t(p - q)^ +m _ n ,^(k)] (24.58) 

where a = l,2,...,iVisa replica index; eventually, we must take the replica 
limit, N — ► 0. iVp is the single-particle density-of-states at the Fermi en- 
ergy. The resulting effective action now contains four quartic terms: one 
disorder term and two interaction terms. Note that the disorder term is 
non-local in time since it only depends on two Matsubara frequencies, and 
it couples different replicas. The interaction terms depend on three Matsub- 
ara frequencies, so they are local in time, and they do not couple different 
replicas. 

We decouple these quartic terms with three Hubbard-Stratonovich fields, 
Qnm,^ Y nm = Y a (n-m), X* m ^ = X^(n-m). It will be useful to 
think of X, Y both as matrices with indices n, m and also as functions of 
a single frequency w n _ m ; they have only a single replica index, however, 
because interactions do not mix different replicas. Note that X a p ap a and 
Y are essentially the spin and charge density. Integrating out the fermions, 
we can write the partition function as 

Z = j VQVXVY e SefflQ,x,Y] (24.59) 
where S e g[Q,X,Y] is given by 

S eS [Q, X,Y] = Y,j [~tr In (ie n + ^V 2 + n + + iT 1 Y + T 2 X 

+ ^tr (Q 2 ) + 1 -T 1 tr (Y 2 ) + \ T 2 tr (X 2 )] (24.60) 

In this expression, we use 'tr' to mean the trace over all of the indices 
which are not explicitly written. Matsubara indices are treated as ordinary 
matrix indices except that their summations come with factors of T. 
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The saddle-point equations are 

irNpQ = i [ —„ (24.61) 

J (2tt) 2 i e ; + 1 i-v 2 +v + £Q + ir 1 Y + r 2 x 



1 



Sn+m,n + ^V 2 + fX + £Q + iT, Y + T 2 X 

(24.62) 

Y(m) = iTV / - 

V V (2vr) 2 ze„<5 n+m , n + ^V 2 + M + ^Qn+m,™ + ^ F + T 2 1 

(24.63) 

The momentum integrals on the right-hand-sides of (24.62) and (24.63) 
are simply iTTNpQ n+myn , according to (24.61). Hence, they have the solution 

Q± ja/3 = sgn(e n ) 5 mn 5 af3 5 ab 
Kp{m) = 

Y a (m) = (24.64) 

Note that the right-hand sides of the second and third equations of (24.62) 
and (24.63) vanish for m / because the saddle-point Q is diagonal in 
Matusubara frequency while they vanish for m = because of cancellation 
between positive and negative frequencies. As we will see later, for T 2 suf- 
ficiently large, X = becomes unstable and there is an X / solution of 
(24.62). This is the Stoner instability. 

The low-energy fluctuations about this saddle-point are transverse fluc- 
tuations of Q, which can be parametrized by V^ aj3 : 



vv^) 1 ' 2 V 

{i-vwy 



Q=\ r + ' ' 1/2 (24-65) 



The diagonal blocks of Q nrn correspond to Matsubara indices n > 0, m > 
and n < 0, m < while the upper right block corresponds to n < 0, m > 0; 
the lower left block, to n > 0,m < 0. These fluctuations correspond to 
diffusion of charge and spin. The longitudinal fluctuations of Q as well as 
fluctuations of X, Y are gapped at the classical level (i.e. tree-level). 

In order to derive an effective field theory for these diffusion modes, we 
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shift Q^Q- 2rri Y + 2riT 2 X in (24.60) 



S cS [Q,X,Y] = E/ t" tr ln ( ie « + + ^ + 27^) 

+ ^— -tr (Q 2 ) - TriV^tr (QY) + mN F T 2 tT (QX) 
4r 

+ 1^1 - 2tt ( rr E) ^ r2 ) r2tr (* 2 )1 ( 24 - 66 ) 



Formally, the factors of r (T£) m ) in the final two terms are infinite as 
a result of the unrestricted Matsubara sum. However, upon integrating out 
the massive longitudinal modes of Q, we see that they are the first terms in 
the series rT£ m - 2vr (rT£J 2 + . . . = rT^J (1 + 2tttT£J, which is 
simply 1/2-7T. Thus, we replace the factors of r (TJ2 m ) by 1/2tt in (24.66). 
Alternatively, we could expand the tr ln(. . .) about the diffusive saddle-point 
and keep only terms up to second order in X and Y. We would then obtain 
the same expressions, namely (24.66) with the factors of r (T^2 m ) replaced 
by 1/2-7T. The coefficient of X 2 is now 1 — NpF 2 , which becomes negative 
for T 2 > 1/Np- the Stoner instability. Finkelstein actually expanded the 
tr ln(. . .) only to linear order in X and Y. For small T±, T 2 , there is no 
difference, but the Stoner instabilty is missed. 

We now expand the tr ln(. . .) about the diffusive saddle-point. Thus, we 
require that Q is a constrained field of the form given in (24.65). 



S cS [Q] = ttN f J d d x {Dtr(VQ) 2 - AiZti (eQ) 



- Titr (QY) + ir 2 tr (QX) 

i r 2 



+ _(l-7V>r 2 ) -^-tr(X 2 )} (24.67) 



The diffusion constant D is given by D = v f t/2. 

Following Finkelstein, we integrate out X, Y and obtain an effective ac- 
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tion for the diffusion modes. 

S eS [Q] = nN F J d d x {Dtr(VQ) 2 - 4itr (eQ) 

— TfNFFiQ nin2iCea Q n3n4 ^<5 ni _ n2 + n3 _ n4 

+ nN F r 2 Qn a i n 2a /3Qn a 3 n4l3a S n 1 +n2-n3-n4} (24.68) 

Note that ordinary matrix multiplication rules reflect the non-locality in 
time of the first term (the 'disorder term') in this action. The interac- 
tion terms do not involve matrix multiplication and are, consequently, local 
in time. In this expression, = I\2/(l ± NpTi^)- These corrections 
to follow from our retention of the X 2 and Y 2 terms which Finkel- 
stein drops. For small, = I\2, and Finkelstein's effective action is 
recovered. 

Initially, the coefficient of the tr (eQ) term is 1, as in (24.67), but quan- 
tum corrections cause it to flow, so we have followed Finkelstein in intro- 
ducing the coupling Z. 

24.5 The Metal-Insulator Transition 



24.6 Mesoscopic fluctuations*** 



